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foreword 


’his is the first of two special issues on the important topic of competitive manufacturing 
ystems. This topic is motivated by the heightened importance of the manufacturing domain 

I the wake of intense global competition and tremendous advances in technology. So, 
/hen Professor Viswanadham suggested in March 1996 that we bring out a Special Issue 
f Sddhana on this topic, we readily agreed. 

''competitive Manufacturing Systems 

i competitive manufacturing system designs, produces, and delivers high-quality, 
ustomer-desired products faster than competition, in a dynamic and uncertain environ- 
aent. World-wide, competition is now very intense and customers have exacting demands 
if low cost, low defect rates, high performance, and easy maintenance. Product life cycles 
re shrinking rapidly and new technology innovations are bringing about dramatic changes 
a the manufacturing domain. The basis for competing with other challengers has gradually 
hifted from cost and quality to time and core-competence. A world-class manufacturing 
nterprise should be able to perform at a high level with agility in the face of such dynamic 
nd uncertain factors sweeping the manufacturing world. Researchers and practitioners 
lave identified several core issues for the effective design, management, and continuous 
mprovement of such high-performance manufacturing systems. These include: Flexible 
utomation, business process management, flexibility management, systems integration 
nd strategic networking. 

kope of the Special Issue 

lince the success of a manufacturing enterprise depends not merely on the effectiveness 
>f the factory floor operations but more critically on elements such as suppliers, logistics 
letwork, distributors, and new product designs, the scope of the special issue encompasses 

II these elements of a manufacturing enterprise. Also, the traditional “functional” view of 
n enterprise as a sequential arrangement of functions such as marketing, finance, R & D, 
nd manufacturing is giving way to a more logical “process” view that regards an enter- 
>rise as a conglomerate of value-delivering business processes. These processes cut across 
Qultiple functional domains and emphasize how the organization delivers value to the 
nstomers. Important business processes include: Factory floor process, new product de¬ 
velopment process, plant maintenance process, supply chain process, and order-to-dehvery 
)rocess. The scope of this special issue includes business process modelling, analysis, and 
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management. In addition to the above “cosmic” and “process” view of a manufacturing 
organization, the special issue also focuses on important traditional topics such as quality, 
flexibility, modelling and simulation, scheduling etc. 

Overview of the First Special Issue 

We now give an overview of the seven papers appearing in this, the first special issue. 

The first paper is by Mathur & Pattipati and is on the topic of off-line quality engineer¬ 
ing methods, which have aroused widespread interest following Taguchi’s robust design 
methodology. Quality engineering is an important methodology for controlling, maintain¬ 
ing, and improving product quality. It addresses two main types of problems: On-line 
quality control and off-line design of products and processes for quality. In this paper, 
Mathur & Pattipati provide an excellent overview of the evolution of off-line quality en¬ 
gineering methods with single or multiple quality criteria. Since most design-for-quality 
problems involve multiple quality criteria, the authors emphasize various methods avail¬ 
able for multi-objective optimization techniques for robust parameter design. This article 
is rich with examples and provides a clear overview of the problems, methodologies, and 
recent results. 

In the second paper, authors Ramesh Srinivasan & Jayashankar Swaminathan treat an 
important problem of planning and coordination that results when manufacturers strive 
to offer broader product lines to achieve greater customer satisfaction. Focusing on the 
computer industry, the authors study the increase in the complexity of the manufacturing 
process and in the complexity of forecasting, parts planning, final assembly, and delivery 
of products entailed by the effort to offer more variety in product lines. In particular, 
the authors consider a feature-based product line in the computer industry and highlight 
issues related to forecasting, parts planning, final assembly, and inter-plant coordination. 
Techniques adopted to overcome the challenges are also described. 

The third paper, by Sridhar Tayur, provides an overview of the recent developments 
in the single product, discrete-time, capacitated production-inventory models. The model 
is motivated by typical supply chain design problem in a diverse set of companies. The 
author brings out the interaction between demand variability, available production capacity, 
inventory holding costs, lead times, and desired service levels. Several significant advances 
have occurred in the study of such models in the last few years and the author has provided 
a clear overview of the problems and recent techniques. 

The next two papers are on scheduling. The first one is concerned with stochastic 
scheduling with setups in a single flexible machine, and the second one with the heuristic, 
deterministic scheduling of an assembly job shop. Narahari & Hemachandra focus on a 
flexible machine serving multiple classes of products, with a setup or switch-over time 
and/or cost involved whenever the machine switches to a different product type. Choosing 
the total cost as the sum of discounted inventory and setup costs over an infinite horizon, pj 
they prove the optimality of exhaustive service policies for a fairly general arrival pattern of 
jobs. Their result generalizes the earlier work of many researchers who were constrained to 
assume independent Poisson inputs. The arrivals considered by Narahari & Hemachandra 
are more general; in particular, correlated in the Markov modulated sense. The second 
paper on scheduling is by Awate & Saraph. This paper addresses the issue of scheduling 
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assembly job shops in contrast to the more popular job-shop scheduling. It advocates 
: interesting look-back approach to take into account the staging delays suffered by 
>erationally late jobs. Their scheduling method generalizes several existing approaches 
d outperforms well-known heuristics. 

The sixth paper in this issue is by Raviraju, Reddy & Krishnaiah Chetty. This paper 
[dresses an important class of modem manufacturing systems, namely Just-in-Time pro- 
iction systems, and makes a case for modelling them using priority nets. The priority net 
odels are simulated to gain insights into many performance issues and the sensitivity of 
:rformance to important input parameters. 

Developments in software technology have been at the heart of many advances in the 
anufacturing domain. Manufacturing planning and control software belongs to the realm 
software in the large. The development of large-scale and complex software products, 
ch as in the manufacturing world, is a complex and expensive process. Sarma & Vijay 
lo, in the last paper of this issue, present queueing models for the software testing 
ocess in such a software product development environment. The models are based on 
e re-entrant line model, which itself was proposed in the first place to model re-entrant 
anufacturing systems such as semiconductor fabs and thin film lines. 

We would like to acknowledge the continuous support and ideas from Professor 
iswanadham. The authors have been most co-operative and have taken minimal time with 
eir papers. All the reviewers have been very prompt and thorough in their reviews. It is 
ostly because of this wonderful, quick response from the authors and the reviewers that 
is special issue has seen the light in exactly one year’s time (from conception to market), 
le Academy staff, coordinated efficiently by Ms. Shashikala, have also contributed sig- 
ficantly to the compression of this issue’s lead time. We would also like to acknowledge 
e support provided by our respective departments: Computer Science and Automation 
the Indian Institute of Science; School of Industrial Engineering and Management at 
e Oklahoma State University; and the Kellogg Graduate School of Management at the 
orthwestem University. 


ibruary 1997 Y NARAHARI 

M KAMATH 
RAVI ANUPINDI 
Guest Editors 



Single- and multiobjective optimization problems in robust 
parameter design 


AMIT MATHUR and KRISHNA R PATTIPATI 

Department of Electrical and Systems Engineering, University of Connecticut, 
Storrs, CT 06269, USA 

e-mail; [amit,krishna]@soLuconn.edu; amit@teamqsi.com 

Abstract. This paper reviews the evolution of off-line quality engineering 
methods with respect to one or more quality criteria, and presents some recent 
results. The fundamental premises that justify the use of robust product/process 
design are established with an illustrative example. The use of designed exper¬ 
iments to model quality criteria and their optimization is briefly reviewed. The 
fact that most design-for-quality problems inyolve multiple quality criteria mo¬ 
tivates the development of multiobjective optimization techniques for robust 
parameter design. Two situations are considered: one in which response sur¬ 
face models for the quality characteristics can be obtained using regression 
and considered over a continuous factor space, and one in which the prob¬ 
lem scenario and the experiment permit only discrete parameter settings for 
the design factors. In the former scenario, a multiobjective optimization tech¬ 
nique based on the reference-point method is presented; this technique also 
incorporates an inference mechanism to deal with uncertainty in the response 
surface models caused by finite, noisy data. In the discrete-factors scenario, an 
efficient method to reduce computational complexity for a class of models is 
presented. 

Keywords, Multiobjective optimization; off-line quality engineering; Taguchi 
methods; robust product/process design. 


1. Introduction 

The primary goal of a quality program in the manufacturing industry is to design and 
implement a production process geared towards satisfying the customer. It concerns itself 
with all aspects of the process, from conceptualization through design, to production and 
operations. Besides the need for an environment of total quality management built on a 
concurrent engineering foundation, the quality engineering tools needed to evaluate and 
optimize physical quality characteristics are paramount to the success of a quality program. 
Development of some of these tools is the focus of this article. 


Quality engineering deals with a methodology for maintaining (controlling) anc 
proving product quality. It addresses two main types of problems: on-line quality-coi 
and off-line design of products and processes for quality. The on-line techniques have 
erally been researched and applied under the rubric of statistical process control (S 
where the popular tools are various types of control charts (Montgomery 1991a). Th< 
line design-for-quality problem has aroused widespread interest in the West only sine 
eighties with the popularization of Taguchi’s methods of parameter design using desi 
experiments (also known as robust design, Phadke 1989). While SPC aims at mainta 
the quality of a production process at prescribed levels with the tacit assumption thi 
design has been optimized prior to production and that no further design improvem( 
necessary, off-line product development uses the tools of experimental design to o 
models for the various quality metrics of interest, which are then optimized with re 
to the design variables (factors). In both exercises, however, the ultimate objective 
ensure that the production system performs as close to the targeted levels and with as 
variability as possible. 

The prerequisites for a robust product/process-design exercise are the identific 
of suitable metrics or performance measures that can summarize quality, and the sp 
cation of targets for these measures. The objective then is to obtain values of the va 
product and process parameters that bring these measures to their targets. Manuft 
ing processes almost always require consideration of several individual quality cri 
whose physical relationships to the control parameters or design variables, as well as 
each other, are not always known. The problem, therefore, involves modelling all qi 
characteristics with respect to the control parameters (input variables) and simultar 
optimization of these quality criteria. Since the same settings of control parameters, in 
eral, cannot optimize all objectives, a trade-off or compromise is inevitable. The fc 
methodology for doing so, viz., multiobjective optimization (also termed multiple-cr 
decision-making or multiobjective programming), has been a subject of extensive res( 
in the area of resource management and planning for more than two decades (Zeleny ] 
Chankong & Haimes 1983; Steuer 1989). On the other hand, quality engineering has 
cemed itself mainly with the statistical and modelling aspects of the problem wit 
underlying expectation that the number of optimization variables would be small an( 
the concomitant optimization problem would be solvable via ad hoc methods. Wit 
need for increasing sophistication in product design and process improvement in m( 
manufacturing, it is important to incorporate a formal methodology in quality engine 
that integrates both statistical modelling and optimization into a single framework, 
is especially so as the number of quality criteria and the number of variables affectin 
process or product performance increase, rendering visual/graphical tools cumber: 
or ineffective. Multiobjective optimization methods can be applied to quality engine 
problems with only a few conceptual extensions. 

This paper reviews the evolution of off-line quality engineering methods for s 
and multiple objectives, and introduces some recent results. Since the focus is oi 
engineering tools (also known as the CAE (computer-aided engineering) tools, Boza 
1994), the issue of how to identify appropriate quality characteristics and the facte 
interest in a production system is ignored - those objectives are best accomplishe 
extensive brainstorming by the quality team (Bendell et al 1989; Boza et al 1994). I 
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remainder of this article, we first give a general overview of experimental design techniques 
and the notion of robust design attributed to Taguchi. The optimization of quality metrics 
with respect to continuous factors is addressed first and then the optimization problem 
with respect to discrete factors is considered. For the case of continuous design factors, 
we also examine the optimization problem for the single as well as the multiple objective 
cases in presence of uncertainty in models estimated from experimental data. We present 
a strategy for design improvements that account for the uncertainty in regression models. 
All concepts and methods are illustrated by examples. 

2. Overview of off-line design 

2.1 Quality metrics and experimental design 

The goal of Taguchi’s design-for-quality approach is to choose levels (or settings) of 
factors (or design parameters) that minimize the variability of a product’s function while 
achieving the target (Taguchi 1987; Phadke 1989; Nair 1992). This is in contrast with the 
traditional (and uneconomical) approach of ‘accept-reject,’ where products not satisfying 
the specifications within a tolerance are rejected. Taguchi’s philosophy is based on the 
premise that quality can be built into a product by a proper choice of design settings to 
lower the inherent variability, and thus to lower the cost of producing quality products. This 
cost first needs to be quantified by a quality loss function. A simple quality loss function 
for a quantitative quality characteristic (or response) y, which is targeted to have a value 
r, is the quadratic loss function 

L(y) = ko(y - Tf. 

Figure 1 plots the quadratic loss function in relation to the step loss function that corres¬ 
ponds to the tolerance-based approach; 8 is the tolerance limit about the target T, and Aq 
is the cost incurred in rejecting a product. Thus, the objective with respect to the quadratic 
loss function is to achieve product response as close to the target as possible by choosing 
appropriate parameter settings and without modifying the product technology itself. On the 
other hand, the step loss function attributes no loss of quality to a product whose response 
lies within ±5 of the target T, and hence no attempt is made to bring product response 
as close to the target as possible. Quality loss can be reduced in this approach only by 
improving technology - a more expensive alternative to parameter design. 

Let X = [xi, X 2 ,..., xa:] be a vector denoting the values of K (controllable) factors 
which affect the product response y, and whose levels can be fixed by design. Since the 





response }'(x) at the factor settings x varies randomly from sample to sample, resulting in 
a distribution of loss function values, an average (over the entire product population) of 
the loss function should be considered. For the quadratic loss function, the average loss 
function (assuming = 1) is the mean squared-error, 

MSE (X) = £[L(}>(x))] = aHx) + (n(x) - Tf, 

where E is the expectation operator, and /i,(x) and cr^(x) are, respectively, the mean and 
variance of the response y parameterized by x. For the mean squared-error type quality 
metric, quality is optimized by minimizing MSE(x) with respect to x taking values in a 
set S (factor space). 

Taguchi has proposed quality measures, called signal-to-noise ratios, for three different 
types of quality characteristics (e.g., Taguchi 1987; Phadke 1989). For a smaller-the-better 
type quality characteristic, where the target T to be achieved by the characteristic y is zero, 
it is defined as 

SWs(x) = -101og,o£[/(x)]. 

For a nominal-the-best type characteristic, it is defined as 

SNrix) = 101ogio[Ai^(x)/or2(x)], 
and for the larger-the-better type characteristic, it is defined as 

SiVi.(x) = -101ogioE[l/y^(x)]. 

Taguchi’s assertion is that maximization of these signal-to-noise ratios with respect to x 
would result in minimization of variability. For the nominal-the-best characteristics, he 
suggests identification of signal factors which affect the mean of y, but not its the 
signal factors can be used to bring the mean to the target following the maximization 
of SNt- Statisticians have devoted extensive effort to justify Taguchi’s measures and his 
two-step approach, but have found only very special scenarios in which their use can be 
validated (Leon et al 1987; Box 1988). Where the quadratic loss function is appropriate, 
the use of the mean squared-error (MSE) would be ideal, provided it can be empirically 
modelled using experiments. The fact that signal-to-noise ratios have been reported to have 
worked in several case studies demonstrates that there is room for significant improvement 
in many currently used manufacturing processes and that even ad hoc methods can realize 
some of that improvement. 

2.2 The scope for robust design 

To demonstrate that variability can indeed be reduced by parameter design, consider the 
following illustrative example (taken from Boza et al 1994). 

A circuit example: Consider the simple AC circuit shown in figure 2, where the power 
source £ is known to operate at a tightly toleranced frequency / of either 50 or 60 Hz, and 
an rms value of 100 VAC with a tolerance of ±10%. It is desired that the rms current I be 
as close to 10 amperes and with as little variability as possible. The design parameters are 


~\m 
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Figure 2. An AC circuit. 


[le nominal values of the resistance R and the inductance L, which are also toleranced at 
=10% about their nominal values. It is assumed that £, R and L are independent random 
ariables with means at their nominal values and the (±10%) tolerances at 3cr limits from 
ominal, e.g., the nominal value Rq of R is such that 

3o'/;q = 0.1/?o. 

’he frequency / is assumed to have zero variance about a nominal of either 50 or 
OHz. 

Using the Taylor series expansion about the nominal values of the circuit parameters, 
tie current I can be expressed as 


/ = /o± 




AR + ± 

0 9 ^ 


A/ H-(higher-order terms). 


^here the subscript zero on I and the partial derivatives denotes their evaluation at the 
iominal parameter values Sq, Rq, Lq and fo {£q = 100 VAC, /o = 50 or 60 Hz), and 


Ae = £~£o, AR = R-Ro, AL = L - Lq, A/= / -/q. 

Jsing the assumptions that the parameters are independent and that the terms of third and 


igher orders are negligible, taking the expectation of / in (1) gives the mean value of the 
utput current as 


fo, Ro, Lq) = fo ± 


2 ! \ d£^ 


% + 


1 / 


obtaining (2), we have also used the facts that (d^I/d£^) = 0 and aj ~ 0. For the 
iminal values being considered, the bias E(I) - Iq is negligible (less than 0.1%), and so 
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crjieo, /o, « 0 , Lo) [^]yi + (3) 

With the choice of 8o and /o not in the designer’s control, the mean squared-error 

MSE/(i?o, Lq) = crjiRo, Lq) + (/x/(/?o. - Ti)^, 

(where Tj = lOA is the target for /) can be minimized with respect to the nominal values 
Rq and Lq of the series resistance and inductance to obtain a robust circuit design. Using 
the functional relationship between I and the circuit parameters given in figure 2, the 
following partial derivatives can be derived: 


dl 

I 

3^7 

A. 



ds 


~ 




3/ 

I^R 

3^7 

73 

(?>R^I^ 

,v 

dR 

~ £2 ’ 

3i?2 “ 

£2 ' 


/’ 

3/ 


3^7 


. / 


3L 

= -{271 fY-^\ 

3L2 “ 

(271 fr^ i 

£2 



A 


The contours of the mean squared-error versus the nominal values of R and L are shown 
in figure 3 for both nominal frequencies. It can be seen that the surface has ridge-like 
behaviour, and, hence, for large ranges of R and L values, the mean squared-error is very 
close to its minimum. 

Boza et al (1994) assumed an initial design setting of {Rq, Lq) = (5f2,0.02H), and ob¬ 
tained a final design of {Rq, Lq) = (8^2,0.017H). Their objective, however, was only the 
minimization of the variability aj which was a sufficient goal as the customer-specified 
tolerance of 2.5 amperes on the targeted I was not violated. However, to achieve good 
robustness at both nominal frequencies, figure 4 suggests a design near {Rq, Lq) = 
(9.43fil, O.OIH), when at both frequencies an MSE = 0.21 is achieved. 

Table 1 gives a summary of the circuit’s performance at the initial design, the design 
obtained by Boza et al (1994), the designs at which MSE is optimum at either frequency. 



/Jo(fi) Rq(CI) 

Figure 3. Contour-plots of MSE/ versus Rq and Lq at 50 and 60 Hz. The marks our 
final design, the ‘ x ’ marks the design obtained by Boza et al (1994), and the ‘o’s mark the 
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Figure 4. MSB profiles versus Rq at two fixed values of Lq; the dashed line indicates the 
minimum achievable MSB under either frequency scenario. 


and at our suggested design. Table 1 also gives the values of the capability index Cpk at 
/ either frequency for each parameter design. Cpk is defined as (e.g., Montgomery 1991a), 

fUSL-M /x-LSLl 
Cp,=min{^—j. 

where USL and LSL are respectively the upper and lower specification limits for the 
response (for this example, USL/ = 12.5A and LSL/ = 7.5A). If the above expression 
returns a negative number, Cpk is taken to be zero. For Six-Sigma Quality (pioneered 
by Motorola Inc; Harry 1988), a Cpk value should be greater than or equal to 1.5. For 
this example, robust design has indeed achieved this goal. Further performance gains are 
possible only by technological improvement, e.g., by improving the component tolerances. 

The above example demonstrates that in order for the product to function equally well 
under two frequency scenarios, a compromise design should be used. While this compro- 
^ mise design was relatively easy to obtain using the contour plots of figure 3 and the MSE 
profiles in figure 4, a visual approach is not possible if the number of design factors or the 
number of objectives exceeds two. The problem of multiple objectives is examined in the 
next section. 

2.3 Experimental response modelling 

In the example considered in the previous sub-section, a physical relationship between 
the response of interest and the design factors was available. In a general manufacturing 


Table 1. Performance of AC circuit for different parameter designs. 


Ro i^) 

Lo(H) 


/ = 

50 Hz 



/ = 

60 Hz 


PI (A) 

0-/ (A) 

MSB; (A^) 

^pk 

PI (A) 

ai (A) 

MSB/ (A^) 

Cpk 

5 


12.458 

0.513 

6.302 

0.03 

11.058 


1.333 


7.1 

0.0225 

9.984 

0.408 

0.166 

2.03 

9.06 

0.371 

1.053 

1.39 

7 

0.019 

10.874 

0.446 

0.962 

1.22 

9.988 

0.408 

0.166 

2.03 

8 

0.017 

10.4 

0.435 

0.349 

1.61 

9.759 

0.401 

0.219 

1.88 

9.43 

0.01 

10.069 

0.452 

0.210 

1.79 

9.854 

0.436 

0.211 
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process, this would rarely be the case. Consequently, empirical models for the metrics 
are obtained. An experiment consists of a systematic variation of the factors and record¬ 
ing of the response(s) defining the quality metric(s) to enable reliable estimation of the 
models. For an experiment of N runs, the procedure involves obtaining (noisy) measure¬ 
ments y(x) of a response at N distinct values xi, X 2 ,..., xjv in the space S of the design 
factors. In Taguchi experiments, since the objective of interest is an aggregate measure, 
such as the signal-to-noise ratio of a product/process characteristic, which summarizes the 
variability of quality characteristic for a given combination of design-factor settings, an 
experiment involves variation of not only the design factors, but also a systematic vari¬ 
ation of noise factors that contribute to the variability in the product’s response. Noise 
factors are variables that are uncontrollable in the real environment but can be manipu¬ 
lated in the experimental set-up. Thus, a Taguchi experiment consists of (i) an inner array 
which lists the design factor-level combinations, and (ii) an outer array that lists the noise 
factor-level combinations for each design factor-level combination. If m is the number of 
combinations of noise factors chosen for an outer array, an experiment yields m replicate 
measurements for each of the N runs. What the best placement of the points xi, X 2 ,..., Xiv 
should be depends on the assumed model-type for 77 (x) (e.g., linear, quadratic etc.), and 
on practical considerations. The problem of design of experiments has been extensively 
researched during this century by applied statisticians, and there exists a rich body of lit¬ 
erature documenting numerous design types and approaches (see, for example, Box et al 
1978; Montgomery 1991b, and the bibliographies therein). In this article, we shall sidestep 
this issue by assuming that, for a given problem, an appropriate design has been identified. 
However, for the purposes of this paper, we mention one useful and frequently used design- 
type: thQ fractional-factorial design. Most of the orthogonal-array designs compiled and 
recommended by Taguchi are of the fractional-factorial type. For example, consider an 
experiment involving the study of the effects of three factors xi,X 2 and X 3 on the response 
y. If each factor is constrained to take only two fixed levels, denoted by 1 and 2, during 
the experiment, the maximum number of distinct runs N is 8 , so that all combinations of 
the levels of each factor are tested. This arrangement, shown in the array Lg in figure 5, is 
a two-level ^//-/< 2 c?ona/ design. If only four of the eight possible combinations are run, 



. . ■ . •'’V 





as in the array L 4 in figure 5, the resulting design is of the fractional-factorial, type. A 
useful feature of fractional-factorial designs is their orthogonality, i.e., the analysis of data 
with respect to each factor can be done using only the column(s) corresponding to that 
factor. Orthogonal arrays (which include some fractional-factorial designs) also possess 
the following key features: 

• each level of a factor appears in an equal number of runs (factor-level combinations); this 
is called the balancing property - it ensures unbiased estimation of all column effects; 

• the effects measured by any two columns are mutually orthogonal (uncorrelated); 

• the effect measured by a column may be aliased (confounded) with the combined effect 
(interaction) of two or more columns. 

Orthogonal arrays are normally used in Taguchi’s off-line design procedure for the inner 
array which determines the combinations of the design factor-levels. In addition, their use 
has also been recommended in the outer array which determines the combinations of noise 
factor-levels, even though the statistical implications of doing are debatable (Montgomery 
1991a). 

2.4 Data analysis for orthogonal-array experiments 

In this sub-section, we briefly review the common data analysis techniques for experimental 
data: analysis of means, analysis of variances (ANOVA), and multiple linear regression. 
These techniques are used to estimate the functional relationship between the factors and 
a response, and to assess the statistical significance of the estimated relationship. While 
ANOVA and regression are the common tools of choice, the analysis of means is often 
convenient for measuring factor effects when an orthogonal-array experiment is used. 
The following results are given with respect to analysis of experimental data using an 
orthogonal array. 

If a trial (or run) corresponds to the level combination (xi, X 2 ,..-^/c) of the factors 
(denoted by x), then we shall assume that a response y can be expressed as 

y(x) = r7(x) -f-e, 

where r}{\) is an unknown deterministic function — the mean response corresponding to 
this factor-level combination, and e is a zero-mean random noise with variance Here, 
the response y is used to denote measurements of either the quality characteristic or any 
suitable loss function derived from it, e.g., the MSE or a signal-to-noise ratio. In either 
case, we assume that e is independent with constant variance over all experimental runs. 
The task of experimental modelling is eventually to estimate the functional relationship 
?](x). 

2.4a Analysis of means and variances: Consider a balanced experiment in which each 
factor-level combination appears in an equal number of trials, and, consequently, each 
of the Lk levels of factor Xk appears in an equal number, nk, of trials, i.e., nkLk ~ N. 
The number of trials N need not be equal to the total number of possible factor-level 
combinations, but in a factorial design, for example, N must divide fllLi We assume 


that we take m replicates per trial and, hence, have Nm observations of the response 
variable: yij,i = \,2, j = 1, 2 ,..., m (note that if y denotes an aggregate or 

summary measure such as an MSE or a signal-to-noise ratio, we would have m — I since ^ 
replicated measurements of the quality characteristic would already have been used to 
compute y). Now, the following sample averages and sums of squares may be defined: 


N m 




Nm ^ 




i=lj=l 


1 m 

^ ;=i 


1 ,..., 


I m 

yw=— E E^-7. ^ = 


.,K; / = l,2,...,L;t, 


(4) 

(5) 

( 6 ) 


N m 

Lk 

Sk=mnkY,{ykj-yf, k = l,2,...,K, (8 ); 

/=i 

Nm 

= -yif, (9) 

/=ly=l 


K 

Sr = S-Y,Sk^Se + SLOF, 

k=l 


( 10 ) 


where y is the overall mean, y; is the average response in the ith trial, y^ / is the average 
response due to factor k taking level l,{i : 4 = /} is the set of indices denoting trial numbers 
in which factor k takes level 5” is the overall sum of squares (corrected for the mean), 

Sk is the sum of squares contributed by the variation due to changes in the levels of factor ^ 
k, and Se is the sum of squares due to pure random error (it accounts for the within-run 
variations). 5r, known as the residual sum of squares, accounts for the contribution SioF^o 
the variability in y due to lack of fit (unaccounted-for effects, e.g., interactions), and due to 
pure random error (Sg). The first three of the above set of equations constitute the analysis : 
of means (ANOM) which enables measurement of effects of factors. The remaining four 
equations are used for AN OVA, which enables the identification of significant factor effects 
that should be included in the model for r}. In (6), the mean responses have been computed 
only for each of the factors taking a given level (main effects) assuming a linear (additive) 
model for t]. But a factorial design and some orthogonal arrays may permit estimation 
of not just the main effects but also interaction effects that measure the combined effect 
of two or more factors on the response. A two-level factorial design, for example, can t 
measure as many effects (main or interaction) as the number of columns in the array. The ^ 
computation of these effects is similar to that given in (6) since exclusive (orthogonal) 
columns in the array can be assigned to measure these effects provided N is large enough 
to accommodate their estimation (see Box et al 1978; Montgomery 1991b). In general, if 
p (main and interaction) effects can be measured by an orthogonal array (p < V — 1), a 



( 11 ) 


prediction equation for rj can be obtained as, 

vih,h, •. •, /a:) = j - y)> 

1=1 

where / 2 , • •.,/a:) is the estimated response at the factor-level combination 
(/], / 2 ,..., /a:) of the AT factors; the levels of the interaction effects are determined from 
those of the corresponding interacting factors. The goodness of fit of the model can be 
measured by the value defined as 

The maximum possible B?' is {S — Se)/S; if the number of replications m = 1, the 
maximum possible is I. 

2.4b Multiple linear regression: An ANOVA is usually accompanied by least-squares 
regression to estimate the response-surface model for rj. ANOVA gives the significant 
effects that can be included in a polynomial model for r]. That is, r}(x) is assumed to have 
the functional form 

r}(x) = ^'z(x), 

where 

Z(X) = [1 ... Xk ... xl X\X2 ...]' 

is a p >< 1 vector containing 1 as the first element, and powers and cross-products of 
xi,X 2 ,..., xic, that are found significant from ANOVA, as the remaining p — 1 elements. 
Assuming m = 1, if zfx,)', i = N, corresponding to the N runs, constitute the 

rows of the V X p matrix X, and if y = [yi,..., denotes the vector of N independent 
measurements of the response, least-squares regression gives the estimates (Anderson 
1984), 

e = (rx)-‘x'y, (12) 

i)(x)=:0z(x), (13) 

for the model coefficients and the response surface. 


2.4c M responses : Multiple linear regression can be extended in a straightforward 
manner to the case of simultaneous modelling of M responses using design matrix X. 
Now M vectors of coefficients, represented as the M columns of the p x M matrix © in 
the model, 

y(x) = 0'z(x) £, 

where 0 = [0i • • • (9 a/] are to be estimated, the measurements y and the noise e are M- 
vectors, and e is assumed to be zero-mean with M x M covariance matrix S. Now, based 
on the measurement data 
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F = X© + E, 


where Y and E are x M, and X is A" x p, the least-squares estimates are 

© = (X'X)"^X'F, 

? 7 (x) = ©'z(x). 

The least-squares estimates satisfy the following statistics (Anderson 1984) 

E (6j) = 6j (unbiased) 

Cov(0j,ej) = a,j(X'X)-‘ 

E(fi(x)) — 0'z(x) = ri{x) (unbiased) 

Cov(t)(x)) =z(x)'(X'X)~^z(x)E. 


(7ij is the ijth element of E {ajj = ap. Also, 


t = [1/(A - pW'Um - X(X'X)"^X']F, 


(14) 

(15) 


(16) 

(17) 


(18) 


where 7^/ is the A x A identity matrix, and E is the unbiased estimator of E provided Y 
is of rank M (M < N — p). 

Under normality assumptions, the estimates are distributed according to the following 
two independent (multivariate) distributions 


•xf X' 

{ 6.0 


1 C72 • • • Om)' ~ Ar((6>'i0i... O'j^)', E . (X'X)-'), 
{N - p)t^W{Y:,N ~ p). 


(19) 

( 20 ) 


where E • (X'X)~Ms a scalar product of the two matrices {an mp xmp matrix with each 
element of E multiplied by the matrix (X'X)""^). In (20), 1F(E, A - /?) is a Wishart dis¬ 
tribution with (A — p) degrees of freedom, parameterized by the matrix E (see Anderson 
1984). The diagonal elements of a 1F(E, A — p)-distributed matrix are chi-squared dis¬ 
tributed random variables with N — p degrees of freedom. 

In the rest of this article, the use of prediction models, obtained either from least-squares 
regression or analysis of means, for optimizing quality criteria will be considered. 


3. Multiobjective optimization 

Consider now the robust design of a product/process with respect to M quality metrics. 
Ignoring for now their estimation using experimentation, suppose the response functions 
r){x) = [771 (x) 772 (x) • • • 7 /a/(x)]' are perfectly known. The design problem then involves 
obtaining the process variable settings x* e 3 that simultaneously result in the most desir¬ 
able compromise of the m responses. As against single-objective optimization, in which 
the optimum response is a unique value (maximum or minimum response), ‘optimiza¬ 
tion’ with respect to multiple objectives refers to the attainment of any one solution in 
a set termed the nondominated solution set. A solution is said to be nondominated (or 
Pareto-optimal, or noninferior) if no other solution is at least as good as this solution 
with respect to every objective and better than this solution with respect to at least one 



lyyiuru^uiiun [jiuuicrri^ in ruuu^i ijuturncicr unAi^n 


i / 



Figure 6. The concept of nondominated solutions (in a two-dimensional objective space). 

objective (see Chankong & Haimes 1983). One can use the partial ordering available in 
the Af-dimensional objective space 11^ to identify the nondominated set by eliminating 
each of those feasible points that is definitely worse than at least one other feasible point 
(with respect to all M objectives). The concept of nondominated points is illustrated in 
figure 6 for the case of maximization of two objectives {M = 2), where the set of attainable 
(feasible) objective-pairs is shown as a closed set Seven points in this set: a, b, c, d, 
e. Pi and P 2 , are also marked for the purpose of illustration. Point a dominates c and e 
since a is better than c as well as e with respect to both objectives, rji and ^2- Similarly, b 
dominates d and e. Points a, b, pi, P 2 , and all other points on that segment of the boundary 
of the feasible region Q between Pi and P 2 , are nondominated, while c, d, e, and all other 
points inside the feasible region are dominated. 

The concept of domination or partial ordering in an M-dimensional objectives space 
can be mathematically defined using the concept of a positive cone D (any closed, 
convex, proper cone): 

qi, qz € 7^^, qi < q 2 qz - qi e D. 

In figure 7, since the point qz — qi is in D, q 2 dominates qi , and since both individually lie 
in D as well, both dominate the zero-vector 0. A nondominated objective (or D-maximal 
or Pareto-optimal objective) q in is defined by 

q € ^2 is D-maximal ^ n (q -{- D) = 0. 

The selection of any particular nondominated solution from the set of all nondominated 
solutions in must be qualified by the preferences of the decision-maker (process- or 
quality-engineer), for the choice of any one nondominated point over the others implies 
a trade-off of one or more objectives for a gain in another objective. There exist several 
different, but closely related, methods of incorporating a decision-maker’s preferences to 
search for the final solution (see, for example, Zeleny 1982, Steuer 1989). In one such 
method, called the method of reference points (Wierzbicki 1980), a reference point (a 
vector of desired objectives) is specified by the decision maker. The method maximizes 
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Q2 

\D 



q2 -qi 


91 


Figure 7. Positive cone D in a' two- 
dimensional (M = 2) objectives space R?. 


a scalarizing junction called the achievement function that guarantees a nondominated 
solution. This is introduced in the next section. 

4. Multiobjective optimization of response surfaces 

Most designed experiments enable estimation of models for the quality metrics over a con¬ 
tinuous region S using least-squares regression. In this case, response surface models (Box 
& Draper 1987) are available for optimization, and nonlinear programming methods can 
be used. For multiobjective optimization, the reference point method can be applied by 
maximizing a scalarizing function (called the achievement function) of the M response 
models over the feasible space. One possible scalarizing function is (Lewandowski et al 
1989) 

M 

q) = . CLiim - qi) -f ^T^CLjiqj - qj), (21) 

defined for a general point q in the objectives space Q C , and for a reference-point 
q. f and a/, / = 1 ,..., M are positive constants, whose values are to be fixed by the 
decision-maker. 

Figure 8 illustrates the concept of the achievement function in (21) in a two-objectives 
space, where the two objectives qi and q 2 are to be maximized subject to the constraint that 
they lie in the set ^2 of attainable objective-pairs. The reference point q that is chosen in 
this figure is unattainable; in general, it could lie inside S2 and the reference-point method 
would still return a nondominated solution. The ‘ideal’ point (Khuri & Conlon 1981), 
whose coordinates are the maxima of the individual objectives, is also shown, denoted 
here by q/. (In an example that we have considered later in this section, the ideal point is 
chosen as the reference point.) The contours of the achievement function in (21) are shown 
for two values; ^ = 0, and s = S 2 (J2 < 0)- The shape of these contours is also the shape 
of the chosen domination cone Dp, e.g., all points right of the contour corresponding to 
5 = 0 dominate the reference point q. The boundary Q of the attainable set Q, indicated 
by the tliick line comprises the nondominated set, i.e., no other point in ^2 dominates a 
point in this set with respect to both objectives. The point q* is the multiobjective optimum 
solution obtained by the maximization of the chosen achievement function over q € ^2 (the 
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Q: Set of attainable objective-pairs {qi, q^) 


Figure 8. Concepts of reference point and achievement function for a two-objective max¬ 
imization case. 

5-contour is tangent to the set Q sdq*). is related to the angle 9 as shown in figure 8. It 
determines the order that a decision-maker wishes to assign to points that do not dominate 
one another. The parameters ai and 0:2 determine the slope of the dividing plane between 
the two piecewise-linear regions for 5 . 

The optimization problem can now be expressed as 

max5(q, q) (22) 

or, using q = ? 7 (x), 

maxs(77(x),q). (23) 

xeS 

When a unique, deterministic mapping 77 : E —^ is not known, and response surface 

models have to be used, one may instead maximize 

M 

sir){x),q)= min (^f (x) - ^/) + ^ (rjj (x) -qj). (24) 

The issue that the objectives are no longer deterministic, but involve modelling uncertainty, 
can be handled as follows (see Mathur & Pattipati 1995). 

The maximum position x* of 5 ( 77 (x), q) is declared as the ‘multiobjective optimum’ 
but, in addition, a region similar to a confidence region is associated with it. This region is 
obtained via Monte Carlo simulations about the estimated models using the least-squares 
estimates and their distributions. Since, from (19), 
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(»'l - (®1 ^2--- s'u)' ~ S 

the model coefficients 0 \, 62 , ...,6 m may be generated using the normal distribution, 

jV((0i02..-0'„)',i:-(rx)-’), (2; 

which is parameterized by E instead of the (unknown) E. Also, since (N — p)Y^ is itse 
distributed as a Wishart distribution W(Yi, N — p) (see, for example, Anderson 1984), or 
might generate matrices using W (E, N—p) and use these to parameterize the above norm; 
distribution. The multiobjective optima of the models so generated can then be obtaine 
for all the runs and plotted on a scatter plot to visualize the region of distribution of tt 
optima. For cases involving more than two or three input variables, where visualization c 
such a region is not easy, a Monte Carlo significance test procedure (Barnard 1963) ma 
be used to compute a boundary for this region. 


Monte Carlo significance test: Let x/, i = 1, 2,..., be the positions of the optim 
obtained from Ns independent Monte Carlo simulations. Let these points be realizatior 
of independent random vectors with distribution p(x) which is unknown. Let f(x) be 
test criterion for the hypothesis that the true optimum is distributed according to p(x). W 
denote these statistics hyt\,t 2 ,^--,tNs the Ns runs. Now, for a level of significance i 
(0 < a < 1), the region in which the true optimum lies can be obtained as 


{x ; f(x) < f(LiV,(i-a)j)}, (2f 

where is the statistic of rank j obtained after sorting {t, } in ascending order, an 
lNs{l - q;)J is the largest integer less than or equal to Nsil - a). We shall refer to th 
region defined by the set in (26) as the ‘significance region.’ 

The choice of a statistic t depends on the distribution /?(.) of the x,-, which is unknowr 
The ideal (but intractable) approach would be to obtain a nonparametric estimate of th 
distribution function based on the Ns measurements x,-. However, for some applications, 
is possible to obtain reasonably good approximate regions based on a statistic that utilize 
the sample estimates of the first few higher-order statistics: mean, covariance, (multivariate 
skewness, kurtosis, and so on. Let x be the sample mean, and S be the sample covarianc 

matrix. In addition, defining u/ = S~^(xi - x), where u,- = (w/i, m/ 2 , . •., UikY, 1« 
nmn, l,m,n = 1,2,..., ^, be the lmn\h element of the sample third-order cumular 
(skewness) tensor; 


^Imn 


1 


Ns 

tiiiUiffiUifi, 

/=! 


and let Kimnp, l,m,n, p — 1,2,..., K,betht Imnpth element of the sample fourth-orde 
cumulant (kurtosis) tensor; 

Ns 


1 
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Table 2. Experimental design and measured response data (example). 


1 , 

Run 

Design (original) 

Design (coded) 


Responses 


(mM) 

X 2 (mM) 

X\ 

X 2 


y2 

y3 

V 4 (mm) 

!5i 

1 

8.0 

6.5 

-1 

-1 

2.48 

0.55 

1.95 

0.22 


2 

34.0 

6.5 

1 

-1 

0.91 

0.52 

1.37 

0.67 

elf 

3 

8.0 

25.9 

-1 

1 

0.71 

0.67 

1.74 

0.57 


4 

34.0 

25.9 

1 

1 

0.41 

0.36 

1.20 

0.69 

•ns 

5 

2.6 

16.2 

-1.414 

0 

2.28 

0.59 

1.75 

0.33 

lal 

6 

39.4 

16.2 

1.414 

0 

0.35 

0.31 

1.13 

0.67 


7 

21.0 

2.5 

0 

-1.414 

2.14 

0.54 

1.68 

0.42 

he 

8 

21.0 

29.9 

0 

1.414 

0.78 

0.51 

1.51 

0.57 

9 

21.0 

16.2 

0 

0 

1.50 

0.66 

1.80 

0.44 

of 

10 

21.0 

16.2 

0 

0 

1.66 

0.66 

1.79 


av 

11 

21.0 

16.2 

0 

0 

1.48 

0.66 

1.79 



12 

21.0 

16.2 

0 

0 

1.41 

0.66 

1.77 

0.43 


13 

21.0 

16.2 

0 

0 

1.58 

0.66 

1.73 

0.47 


mM = millimolar 


m . 

m where S[j = 1 , if / = 7 ; otherwise, it is zero. Higher-order cumulants can similarly be 

? a computed. If the skewness, kurtosis, and higher-order cumulants are negligible, it may 

iVe suffice to use the statistic 

f Cf 

r 1 (x) = (x - x)' 5” ^ (x - x) = u'u. (27) 


:6i 

ml 

he 

.0. 

he \ 

.It 

KS 

lei 

rce 

let 


tkr 




In general, a statistic based on the likelihood function involving the higher-order terms in 
an Edgeworth expansion (see, for example, Kolassa 1994) could be used: 


?2(X) = -0(U) 



K 

^Imn Hlmn (u) 


1 ^ 

^Imnp Himnp i^) “b 


l,m,n,p=\ 


(28) 


where 0 (u) is the K -variate standard Gaussian density function (for mean 0 and covariance 
matrix I). Himn i'o) and Himnp{u) are, respectively, the third-order and fourth-order K- 
dimensional Hermite polynomials defined as 

(-1)3 33 ^^ 

' </>(u) duidundun 

~ tliUman i^lman “b ^mn^l ~b ^nlO-rn) » 


and 


Hlmnpi^) — 


aU(«) 


i-if 

(f)(yi) 3uidUffi3Ufibup 

UiUfnUfiUp i.^npO'infn "b • • • “b ^mpaiUn) 
“b (.^Im^np "b ^In^mp "b ^Ip^mri)- 
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Figure 9. Regions corresponding to non- 
dominated solutions and dominated solu¬ 
tions with respect to all four responses 
based on the estimated models (example). 


The higher-order polynomials have similar extensions. 

To summarize the above, the procedure involves the following steps: 

(1) Using the experimental data for the responses, obtain regression models and estimates 
of the covariance matrices of the model coefficients using (15) and (18); 

(2) Compute the achievement function (24) over the region S using the predicted responses 
and the reference point input by the user; obtain the multiobjective optimum settings 
by maximizing the achievement function over the region S; 

(3) Obtain, several independent sets of models by Monte Carlo simulations using the 
least-squares estimates of the statistics in the generating distribution (25); for each 
realization, repeat step 2 to obtain a sample of points {x/}; 

(4) Apply the statistic in (28) (or (27) if adequate) to the sample optima {Xj} of step 3 to 
obtain a region of the form (26). 

The feasible region S in the factor space is recommended to be (two or three times) larger 

than the experimental design region in order to ensure that the sample points obtained 


Table 3. The second degree regression models (example). 


Model term 


Regression coefficients 


yi 

y2 

3’3 

}’4 

int. 

1.526 

0.660 

1.776 



-0.575 

-0.092 

-0.250 


•^2 

-0.524 

-0.010 

-0.078 


A 

-0.171 

-0.096 

-0.156 



-0.098 

-0.058 • 

-0.079 


X\X2 

0.318 

-0.070 

0.010 



0.95 

0.98 

0.98 

0.95 
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Figure 10. Distribution of the duat-objective optimum (for ^2 and y^) for the example; the 
‘+’s show the positions of the optima of the individual (estimated) responses. The region 
of significance is obtained using the statistic t\ in (a) and t 2 (with the first four orders of 
moments) in (b). 


via simulations lie inside S (see Mathur & Pattipati 1995); for if a large number of such 
points lie on the boundary of S, the estimation of the moments (and cumulants) used in 
the statistic t could incur large errors. In the final analysis, all inferences can be confined 
to the experimental region. The region of significance should be interpreted as follows: 
if it includes the design centre (current settings), there is not enough evidence to suggest 
a change in the factor settings; if it does not, but is large compared to the size of the 
experimental region, or if it lies outside the experimental region, further experimentation 
is needed. 


4.1 Example 

The following example was originally discussed by Khuri & Conlon (1981) (also see 
Mathur & Pattipati 1996). In this case study of a dialyzed whey-protein-concentrate (WPC) 
gel system, the effects of two inputs: concentrations of cysteine (xi) and calcium chloride 
(X 2 ), on four responses measuring the textural and water-holding characteristics of the 
WPC gel were studied. The four texture characteristics are: hardness (yi), cohesiveness 
(yi)^ springiness (ys), and compressible water (y 4 ). The goal of the problem is to max¬ 
imize the measures of all these characteristics. A 13-point rotatable central-composite- 
design experiment (Montgomery 1991b) with five centre points (for uniform precision) 
was conducted to vary the input variables. The experiment design and the measured values 
of the two responses are recorded in table 2, reproduced from Khuri & Conlon (1981). 




































(a) 



(b) 



Figure 11. (Example, two-response case) 
(a). Attainable objective pairs (of yi and 
>> 3 ) based on estimated models. The ‘-h’ 
marks the chosen reference point, (b). Scat¬ 
ter of the dual-objective optima obtained 
from 1000 Monte Carlo simulations of the 
models. 


Only y 2 and y^ considered: Considering all four responses, it turns out that the current 
operating point x = (0,0) is efficient (nondominated). That is, no further improvement in 
any response is possible without trade-off in at least oiie other response. In fact, a large area 
of the region <2 maps to the nondominated set in the objective space (see figure 9; 

this efficient set is based on the predicted responses). So, for illustrative purposes, we first 
consider only two of the four responses: 72 ys. The choice of ignoring responses 
yi and 3^4 in particular was made because yi and y 4 have no global maxima (based on 
their least-squares models); their stationary points are saddle points which lie in opposite 
quadrants of the variable space, far from the design region. 

For this case, the efficient set (based on the response models) in the factor space is a curve 
connecting the two individual optima. We maximize the achievement function in (24) with 
aj chosen as l/cfy, = 0.01, and the estimated ideal point ^ = [0.681.9]' as the reference 
point. While this choice of aj is arbitrary, and while other choices are possible, it normalizes 
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Figure 12. Distribution of the multiob¬ 
jective optimum (for all four responses) for 
the example; the ‘-f’s show the positions 
of the optima of the individual (estimated) 
responses. The region of significance is 
obtained using the statistic t 2 with up to 
sixth-order moments. 


the achievement function against differences in the noise powers in the two responses, and 
also happens approximately to normalize for the different absolute scales. The estimates 
of 0-2 and are 0.0005 and 0.0025 respectively. For maximization of a Simplex search 
method was used (Nelder & Mead 1965). The model optimum and its ‘significance’ regions 
obtained using the statistics of (27) and (28) (using up to fourth-order terms) are shown 
in figure 10. The multiobjective optimum is obtained at (x*, xj) = (—0.61, —0.16), and 
the predicted response vector at this point is {y 2 , ys) = (0.67,1.88). At the individual 
model optima, the estimated response vectors are (y2, ^ 3 ) = (0.68,1.84) and (y2. ys) = 
(0.63,1.9). The spread of the optimum, obtained from 1000 Monte Carlo simulations 
of the models, indicates the region in which the true optimum could lie assuming the 
% quadratic model assumptions to be correct. The region enclosed by the solid line is that 
obtained from the Monte Carlo significance test for level 0.01; that is, the ten most extreme 
values were excluded from the region. For this example, the statistic in (27) seems to be 
adequate. The statistic t 2 is a little better in following the shape of the scatter of optima as 
it accounts for the sample skewness and kurtosis. In any case, these regions indicate that 
both responses y 2 and ys can be improved by lowering the variable xi to a (scaled) value 
of approximately —0.6, while not altering the variable X 2 from its current value. Since we 
are considering only two responses at this point, it is possible to view the attainable set Q 
in the objectives space, shown in figure 11a. The scatter of the multiobjective optima is 
shown in figure 11 b. 


I All four responses considered: We now apply our method to the multiobjective optimiza¬ 

tion of all four responses in this example. We again choose the reference point to be the 
ideal point [2.69,0.68,1.9,0.72]'. The ay’s are again chosen to be l/dy’s, and = 0.01 
((T^ = 0.0399 and — 0.0017). The multiobjective optimum for these parameters was 
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Figure 13. A two-dimensional discrete 
objectives set. 


from the Monte Carlo significance test (for level 0.01) using the statistic t 2 with up to 
sixth-order terms is also shown by the solid-line curves. The scatter reveals the extent 
to which uncertmnty in the four models can affect the position of the (multiobjective) 
optimum factor settings: the best factor settings could lie anywhere in a band that stretches 
across the experimental design region from the vicinity of (—1, 0.5) to the vicinity of 
(0, —1.4). While maximizing the achievement function in each Monte Carlo run, we have 
deliberately not constrained the maximum to lie inside the circle xf + <2 (see the 

remarks following the introduction of our method). If the scatter of the global maximum 
of the achievement function lies mostly outside the experimental region, there is not much 
sense in fitting a region of significance„as a better interpretation might be to conduct more 
experiments in a new region outside the current design region. For fitting a region about the 
scatter of points in this example, the shape of the scatter demands that higher-order terms 
beyond the third- and fourth-order terms be included in the Edgeworth-series statistic t 2 . 
Figure 12 shows the region obtained by including up to sixth-order terms. Since the current 
operating point (the design centre) also lies inside the significance region, there is not a 
strong case for changing the current settings for multiobjective improvement of all four 
responses. 


5. Multiobjective optimization w.r.t. discrete factors 

When quantitative factors are constrained to take only a finite number of levels, or when 
the factors are qualitative (or categorical), the multiobjective optimization problem 
reduces to a combinatorial optimization problem. Here, the assumption is that the ytth factor 
(^ = 1,..., a:) can only take one of Lk levels: xt G {4,i,..., that is, the feasible 
set S is finite (with cardinality 0^=1 and so is the set Q of feasible objectives. Using 
experimentation, a prediction model is obtained at all factor-level combinations. The solu¬ 
tion of the discrete (multiobjective) optimization problem (DMOP) requires an exhaustive 
search in a space of cardinality n|Li Lk (which we will refer to as the size of the DMOP). 
However, under the frequently occurring conditions of separability of factor effects, the 
problem can be decomposed into problems involving much smaller search spaces. 

Let the mean rjj of the jth response (j — 1,..., M) be related to the K factors by a 
relationship of the form 




(X) = g\’\xi) + g^\x2) +-h gi-'\xK)- 

Then, it is clear that the set of x^’s, A: = 1,..., K, which optimize the respective gjj'^ 
would also optimize rjj. This approach can be extended in a straightforward way to the 
multiobjective optimization of all M responses, as shown by the following theorem (Song 
etal 1995). 

Theorem 1. Let the sets C , k = 1,..., AT and the set C, defined as 
C = {r7|77 = gi + g 2 +-h g/ir, g/t e 

have the same domination cone. Then, if gk is dominated in for at least one k,k — 
I, K, the point ^ = gi + > • • + gA' is dominated in C. 

Proof 1. Assume that D is the domination cone, and r) = gi H-hgA: is a nondominated 

point in C. Then 

(?7 + D) n C = 0. (29) 

Suppose g;t is dominated for some k. Then there exists a G such that gk s gyt + A), 
Since r? = gi + • • • + gA:, we have 

= gl + • * • + g/: +-h gA" € g/ H-+ g/t + • • • + gA: + A). (30) 

Thus ff e C and fj e ff + D, which contradicts the assumption that fj is nondominated. 

The applicability of the above theorem to the discrete factors case can greatly reduce 
the complexity of the optimization problem, for it enables decomposition of the multiob¬ 
jective optmization problem into two stages; the first stage consists of K multiobjective 
optimization problems of sizes Lj, L 2 ,.. Lk- (multiobjective) optimize g^ with respect 
^ to xic. The nondominated sets B^ obtained from the first stage can be used to construct the 
search space C for the second stage whose cardinality can be much smaller than fl/tLi Lk. 
For example, if an experiment involves six three-level factors, the search space contains 
3^ = 729 combinations. If the low-level (first-stage) optimization problems (each of size 
3) reduce the size of the nondominated sets even by one for each factor, the size of the 
search space for the second stage would be reduced to 2^ = 64. Thus, the cost of the high- 
level optimization would be reduced ten-fold. If the effects of some factors do interact, 
the decomposition can still be done with respect to those factors that do not interact, and 
some reduction in complexity attained. 

Search for the nondominated set : An explicit enumeration technique called the tech- 
1 nique of dominate approximations (TDA) (Majchrzak 1989) can be used to obtain the 
nondominated set from a discrete space Q. The technique can be illustrated using fig¬ 
ure 13. In iteration one, the method maximizes the performance measure rji to obtain the 
point d, and then rejects all points dominated by d to generate a dominated approximation 
Q\ — {a, b, c, d, X, y, z); in the second iteration, the method maximizes component r]2 
(excluding d) to obtain a, and then rejects all points dominated by a to generate another 
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Table 4. Orthogonal design (Lig) and summarized data. 






Factors 





Responses 


1 

2 

3 

4 

5 • 

6 

7 

L 

W 

F 

S 

1 

1 

1 

1 

1 

1 

1 

1 

-8.6 

-12.2 

-2.2 

1.0 

2 

1 

1 

2 

2 

2 

2 

2 

-10.3 

-9.2 

-5.7 

1.0 

3 

1 

1 

3 

3 

3 

3 

3 

-15.9 

-14.0 

-8.5 

1.33 

4 

1 

2 

1 

1 

2 

2 

3 

-2.2 

~12 

-1.1 

1.0 

5 

1 

2 

2 

2 

3 

3 

1 

-13.0 

-13.6 

-7.4 

1.0 

6 

1 

2 

3 

3 

1 

1 

2 

-10.2 

-17.5 

-14.7 

1.33 

7 

1 

3 

1 

2 

1 

3 

2 

-13.3 

-16.3 

-7.7 

1.5 

8 

1 

3 

2 

3 

2 

1 

3 

-14.3 

-16.3 

-10.4 

1.0 

9 

1 

3 

3 

1 

3 

2 

1 

-16.9 

-18.2 

-14.2 

2.33 

10 

2 

1 

1 

3 

3 

2 

2 

-12.0 

-11.7 

-2.5 

1.0 

11 

2 

1 

2 

1 

1 

3 

3 

-15.8 

-18.2 

-11.1 

1.0 

12 

2 

1 

3 

2 

2 

1 

1 

-17.1 

-8.4 

-12.2 

1.67 

13 

2 

2 

1 

2 

3 

1 

3 

-18.1 

-13.7 

-9.7 

3.0 

14 

2 

2 

2 

3 

1 

2 

1 

-5.6 

-15.1 

-7.8 

1.33 

15 

2 

2 

3 

1 

2 

3 

2 

-16.1 

-15.1 

-11.8 

3.0 

16 

2 

3 

1 

3 

2 

3 

1 

-11.4 

-16.7 

-12.5 

1.83 

17 

2 

3 

2 

1 

3 

1 

2 

-12.1 

-16.3 

-11.0 

1.33 

18 

2 

3 

3 

2 

1 

2 

3 

-6.3 

-20.9 

-12.5 

2.0 


Factors 1; Injection pressure; 2: Injection speed; 3: Mould temperature; 4: Melt temperature; 5: Holding 
pressure; 6: Cool time; 7; Hold time 

Responses I: Length SNR; W: Width SNR; F: Flatness SNR; S: Surface quality 


dominated approximation Q2 = {a» b. c, d, z}. The process continues until all dominated 
points are eliminated. 

The above arguments are given with respect to perfectly known metric functions 77 (x). 
When experimentation is used, prediction models for these relationships are obtained and 
used in place of the true relationships. To identify the significant factor-effects, an analysis 
of variance (ANOVA) is used. With the use of orthogonal arrays, an analysis of means 
may also be used for easy computation of factor effects. The problem of dealing with 
uncertainty in the models in the discrete-factors case would require a different formulation 
from that for the continuous-factors case of the previous section. In the discrete-factors 
case, there would obviously be no region of significance about the estimated multiobjective 
optimum design, but rather a set of probable designs with an associated discrete probability 
distribution. This problem will be addressed in future research. 

5.1 Example 

An experiment was conducted (Greenall 1989), for the optimization of a manufacturing 
process for injection-moulded plastic housings. The experiment involved seven factors: 
injection pressure, injection speed, mould temperature, melt temperature, holding pressure, 
cool time, and hold time; the first factor was studied at two levels, and the rest of the factors 
at three levels each. Four response variables were considered; overall housing length (T), 
overall housing width (W), flatness (F), and surface quality (5). An Li8(2^3^) orthogonal 
array was used. Table 4 shows the orthogonal design and the summarized data for the 
four responses; the first three responses have been summarized into signal-to-noise ratios. 
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Table 5. Results of low-level optimization. 


Factors 

Levels 


Mean effects 


L 

W 

F 

S 

Injection pressure 

1 

-11.63 

-13.83 

-7.99 

1.28 


2 

-12.7 

-15.1 

-10.1 

1.80 

Injection speed 

1 

-13.28 

-12.28 

-7.03 

1.17 


2 

-10.87 

-13.70 

-8.75 

1.78 

Mould temperature 

1 

-10.93 

-12.97 

-5.95 

1.56 


3 

-13.75 

-15.68 

-12.32 

1.94 

Melt temperature 

1 

-11.95 

-14.53 

-8.57 

1.61 


2 

-13.02 

-13.68 

-9.20 

1.70 


3 

-11.57 

-15.22 

-9.40 

1.30 

Hold pressure 

1 

-9.97 

-16.70 

-9.33 

1.36 


2 

-11.90 

-12.15 

-8.95 

1.58 


3 

-14.67 

-14.58 

-8.88 

1.67 

Cool time 

1 

-13.40 

-14.07 

-10.03 

1.56 


2 

-8.88 

-13.72 

-7.30 

1.44 


3 

-14.25 

-15.65 

-9.83 

1.61 

Hold time 

1 

-12.10 

-14.03 

-9.38 

1.53 


2 

-12.33 

-14.35 

-8.90 

1.53 


3 

-12.10 

-15.05 

-8.88 

1.56 


while surface quality is an average of three independent assessments on a scale of 1 to 3 
(see Greenall 1989, for the complete data and analysis). 

The ANOVA carried out by Greenall (1989) on the signal-to-noise ratios showed that 
the additivity assumptions are fairly well satisfied; so, the two-step approach presented 
above can be applied. 

The mean responses can be computed using an analysis of means to obtain a prediction 
equation of the form (11). From table 4, it can be seen that a total of 2 x 3^ = 1458 
combinations (and, therefore, 1458 performance predictions) of different factor-levels are 
possible. A low-level optimization eliminates dominated points for each factor based on 
the mean effects (the results are shown in table 5). From table 5, it can be seen that the 
number of combinations from the remaining points is 2^ x 3^ = 648. The high-level 
optimization performed on the set of 648 reduces combinations, based on the predictions 
of performance measures using (11), resulted in 135 nondominated solutions. The final 
choice of one operating condition should be made by the design engineer. In practice, 
the design engineer can use the reference point approach and his/her preferences to find 
the most preferred solution from tliis nondominated set in an interactive manner. Let r be 
a reference point specified by the user (decision-maker). A simple strategy would be to 
find the closest nondominated point q, from the already determined set of nondominated 
solutions (denoted c f2), to r such that q = arg{minlq — rj, q e Fjv}. 

For illustrative purposes, each of the seven recommended designs obtained (Greenall 
1989) was chosen as a reference point and the nondominated solution closest to it obtained. 
The results are shown in table 6. refi (i = 1,..., 7) in table 6 denotes the ith solution 
in Greenall (1989), and soli denotes the Pareto-optimal solution found by our algorithm 
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Table 6. Final results of optimization. 




Predicted performance measures 


Levels 

L 

W 

F 

.S 

refi 

-8.91 

-6.12 

-0.99 

1.03 

1112222 

soli 

-8.91 

-6.12 

-0.99 

1.03 

1 1 1 2222 

ref2 

-11.7 

-8.83 

-7.36 

1.42 

1 1 3 2 2 2 2* 

S0I2 

-10.8 

-7.57 

-5.92 

1.75 

12 1221 1 

refs 

-lO.O 

-7.40 

-3.12 

1.55 

2 1 1 2222 

S0I3 

-10.0 

-7.40 

-3.12 

1.55 

2 1 1 2222 

ref4 

-12.8 

-10.1 

-9.49 

1.94 

2 1 3 2 2 2 2* 

S0I4 

-11.8 

-8.86 

-8.06 

2.27 

22 1 22 1 1 

refs 

-9.83 

-7.93 

-3.94 

0.59 

1 1 2 2 2 2 2* 

S0I5 

-9.77 

-7.08 

-3.61 

1.54 

2 1 1 222 1 

refe 

-10.9 

-9.22 

-6.07 

1.10 

2 1 2 2 2 2 2* 

sole 

-10.8 

-1.51 

-5.92 

1.75 

12 12211 

ref? 

-10.4 

-11.5 

-11.2 

2.55 

2232222 

S0I7 

-10.4 

-11.5 

-11.2 

2.55 

2232222 


“•■•Dominated solution.s in Greenall (1989). 

using refj as the reference point. It can be seen that four of the original solutions, marked 
by *, are dominated by solutions found by our method. In other words, four of the original 
solutions in Greenall (1989) are not Pareto-optimal. 

It is clear that an ad hoc approach, such as that used by Greenall (1989), can run into 
difficulties, if the search space of the problem is very large. The two-step approach, on 
the other hand, successfully finds all 135 nondominated solutions and provides a sim¬ 
ple method for the engineer to specify preferences. The reference point itself may or 
may not be attainable; one can enter any point that reflects one’s preferences, and the 
algorithm would always find the ‘closest’ nondominated point. In the case when little 
knowledge or preference information is available for the problem (as would happen dur¬ 
ing the initial stage of design and optimization), one can use the ideal point as the reference 
point. 


6. Summary 

This article has reviewed the use of experimentation to determine the best operating points 
for a manufacturing process, or the best design for a product’s parameters, so as to optimize 
one or more quality criteria. After illustrating the steps in robust design used to lower 
variability, it specifically examined the optimization problems arising wheii several models 
for quality characteristics are estimated from experimental data. In the case of continuous 
factors, a new approach was discussed for dealing with the uncertainty associated with 
the use of response surface models for the quality metrics. This approach prevents faulty 
inferences from the optimization step and gives the designer or process engineer a means for 
determining whether to conduct further experiments or to accept the optimization results. 
In the case of discrete factors, an efficient search technique for the multiobjective optimal 





sign was presented. The problem of developing a scheme for dealing with uncertainty 
the discrete-factors case is a potential subject for future research. 
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Abstract. With the increase in the drive towards greater customer satisfac¬ 
tion, there has been a proliferation in the variety of products offered by man¬ 
ufacturers. Variety provides considerable choice to customers and encourages 
manufacturers to modularize their product lines. However, associated with a 
broad product line are complexities related to design, forecasting, coordination 
and operations. In this paper, we discuss some of these important issues. 

Keywords. Product variety; feature-based product lines; operations manage¬ 
ment; forecasting; planning; computer industry. 


. Introduction 

1 a global economy, manufacturers are driven towards offering broader product lines 
s they have to cater to the needs of customers with very different requirements. This 
as led manufacturers to offer customizable products in different configurations utilizing 
ommon building blocks also known as feature-based product lines. Although offering 
lore variety in the product line increases the number of market segments that are covered 
Bagozzi 1986), it also increases the complexity of the manufacturing process, thereby 
icreasing operational costs (Abeggelen & Stalk 1985). Increase in variety also increases 
le number of components and sub-assemblies that are utilized. For example. General 
Motors Corporation offered 131 different rear-axle assemblies with the intent of providing 
ariety in pickup trucks (Fonte 1994). Such increases in variety lead to greater complexities 
1 forecasting, parts planning, final assembly and delivery of products. 

In a feature-based product line, each product is defined in terms of configurations which 
re built from a feature-set offered by the manufacturer. This provides customers with the 
bility to have their choice of features while defining the product configuration. In addition, 
itense competition between manufacturers has made the time to deliver, or responsiveness, 
n important factor in determining their success. As a result, customers have more choice 
1 terms of product features and have greater control over lead times that they can expect 
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keeping costs under control they have to deal with uncertainty m customer demands and 
respond quickly once customer demands are known. Since the number of configurations 
demanded by customers is enormous, there are additional challenges to the manufacturer 
in demand forecasting and production planning. 

In order to overcome these challenges, manufacturers are adopting several new tech¬ 
niques. These include incorporation of operational constraints during design of the prod¬ 
uct, delaying differentiation of products during assembly, integrating information within 
the organization and across the supply chain, clustering products into product families, 
exploiting commonality in components and manufacturing processes and using better 
decision-support tools for forecasting and parts planning. For example. Ford Motors has 
started designing cai*s for a global mai'ket while keeping in mind the various options that 
it may need to provide to different customer zones (Treece et al 1995). Hewlett Packard 
and IBM are delaying product differentiation in order to provide variety in their product 
line while keeping costs under control (Swaminathan & Tayur 1996). Toyota has been 
clustering products into families to exploit commonality in order to develop cost effective 
designs (Gupta & Krishnim 1995). While these techniques have shown considerable suc¬ 
cess in particular cases where they were employed, in a broader perspective it is essential 
to understand challenges that may arise when variety in the product line is increased. 

In this paper our primai 7 focus is on the computer industry and we highlight some of 
the important issues related to forecasting, parts planning, final assembly and distribution 
that arise as a result of increase in product variety. The rest of the paper is as follows. In 
§ 2, we describe feature-based product structure in greater detail with an example. In § 3, 
we describe issues related to forecasting. In § 4, we describe some of the issues related to 
parts planning, final assembly and interplant coordination and in § 5 we briefly describe 
other issues related to managing feature-based product lines and provide our concluding 
remarks. 


2. Feature-based product lines 

In this section, we introduce the concept of a feature-based product line through an example 
from the personal computer industry. Traditionally, in the personal computer industry 
manufacturers have been offering predefined computer models to customers. Over time 
as a response to customer requirements the number of models grew enormously. For 
example, at the beginning of this decade IBM offered a few hundred models of personal 
computer. Such large numbers of predefined end-products resulted in complexities in 
forecasting and planning and inefficiencies in operations. This led computer manufacturers 
to experiment with new product offering strategies. One such strategy is to offer a feature- 
based product line. As opposed to offering numerous predefined models to customers, a 
feature-based product line permits customers to define their choice of configuration based 
on hardware and softwai’e features provided by the manufacturer. In general, a feature 
is a subassembly or subsystem to which a function can be ascribed from a customer 
perspective. Examples of features are: a graphics hardware kit, token-ring card, 1 GB 
direct-access storage device etc. 
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sub-assemblies, parts 


Figure 1. Customer-orderable items. 


\ customer’s order can be made up of a collection of entities drawn from system 
ts, subsystems, peripheral, other features and parts (see figure 1). A system unit is 
ined by a customer’s choice of feature categories such as processor, hard-drive, memory, 
ver-supply, communication and multi-media features (refer to figure 2). Certain rules 
observed in the specification of a system unit. For example, a customer may not be 
Dwed to choose a low speed processor and a very high capacity hard-drive. A feature- 
;ed approach provides adequate flexibility in defining a system unit. For example, a 
itomer may not buy a hard-drive with a system unit, but may have it installed separately, 
me features like power supply and processor are mandatory in defining a system unit 
ile others like memory are optional. In each feature category, a customer makes a 
^cific choice from the list of options provided. For example, there are hundreds of options 
lilable to choose to satisfy communications and multi-media capability requirements. 
Each option in every feature category has a unique bill-of-materials graph as shown in 
ure 3. For example, part A is an assembly that is made up of 1 unit of part J, 2 units of 
t K and 1 unit of part L. Similar to a conventional bill-of-materials (BOM), the BOM 
a feature-based product line has the characteristics of substitution among parts and 
ectivity windows for parts, two important factors that add complexity to planning. A 
)Stitute part is one which can be used when the primary part is unavailable. There may 
more than one substitute part for a primary part and these may be primary parts in a 
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different section of the bill-of-materials structure. An effectivity window associated with a 
part specifies the time window during which this part can be used in manufacturing. Outside 
this time window, other parts specified in the bill-of-materials definition should be used. 
An effectivity window (defined later) for a part is usually introduced due to considerations 
related to engineering changes and change of supplier. 

Feature-based product offering also exists in mid-range and main-frame product lines. 
In such product lines, a major source of demand for computer hardware and software is 
the customer’s need to upgrade to higher capacity or the latest technology. The demand 
for upgrades can be greater than fifty percent and as high as ninety percent. This results in 
the need to offer subsystems and other modular enhancements to existing systems. 


3. Implications for requirements forecasting 

Plenty of research has been done on forecasting of production and inventory requirements 
for manufacturing industries. A good review of forecasting techniques can be found in 
Fildes & Beard (1992). The authors point out that serious gaps exist in the knowledge 
necessary to design an effective forecasting system for a production enterprise. Two key 
points are made in this regard. First, an effective approach to forecast demand for a product 
has to utilize information specific to the product. Secondly, information on which a forecast 
is based should include data in addition to time-series history, such as orders, marketing 
plans and product life-cycle. We find these conclusions to be very appropriate in the case 
of feature-based products. 

Increased variety in product offering has important effects on product development, 
product upgrade possibilities and customer orders. In the following subsections, these 
effects and their consequences on demand forecasting are described. 

3.1 Product transition 

The product development function in a corporation has the responsibility to develop profi¬ 
table products that closely match customer requirements. This is done based on market 
research and leads to strategies for introduction of new products and withdrawal of old 
products, often referred to as product transition. The function and form of new products 
as well as the timing of introduction are some of the decisions made and conveyed to 
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3 forecasting and business planning units. Since the development of a product involves 
ordination and completion of hundreds of activities, the actual product content realized 
d the month or quarter of a year when a product becomes available for volume production 
s subject to a high degree of uncertainty. Increase in product variety makes each product 
velopment project very complex, thereby aggravating the difficulty in estimating the 
suiting product content and time of volume production. This results in inaccuracy of 
recast for existing products since time and extent of cannibalization of their demand by 
w products are subject to uncertainty. In addition, sustaining existing products beyond 
eviously planned months or quarters could require new parts suppliers or an increase in 
ant capacity, which may be difficult to manage. 

These difficulties resulting from uncertainties in product development can be handled 
' quantitative models. The product development project should be explicitly represented 
a stochastic process made up of several parallel and sequential activities. On analyzing 
ch a process, different possible completion dates for the product development activity 
d their probabilities can be derived. Several demand forecasts for the new product and 
isting product can then be predicated by these completion dates. 

2 Increased upgrading possibilities 

5 indicated earlier, a significant portion of demand for mid-range and main-frame systems 
ises from the customer’s need to upgrade existing hardware capacity or upgrade to new 
:hnology. This results in demand for subsystems, e.g. storage systems and processors, 
addition to what may be demanded with the main computer system. Upgrading also im- 
ies the need for compatibility of subsystems with existing hardware. This often results 
additional parts which constitute an upgrade kit. The accuracy of forecasting demand 
r upgrades can benefit from approaches that consider current installed base of computers 
the market place and the upgrading possibilities for the installed base of computers. In 
se of high-end or main-frame systems, where the volume is low, projection of demand 
r upgrades could be performed by considering each existing customer individually and 
entifying upgrade possibilities depending on the customer’s current product configura- 
)n. Since there may be several upgrade possibilities for each customer, a probabilistic 
odel of upgrade possibilities is appropriate. When the customer population is large, as in 
id-range computers and high-end workstations, the customer base should be segmented 
cording to existing configurations before projecting demand. It is to be noted that even in 
traditional product offering, customers could upgrade their hardware. However, such an 
)grade is usually from one predefined end-product to another. As a result the complexities 
forecasting mentioned above are not prevalent. 

3 Customer requirements 

•aditionally, a few models were offered to end-customers and forecasters could focus on 
ejecting demand for each of them. Each model represented an adequately differentiated 
nctionality and as a result, segmentation of customer population by functionality resulted 
disjoint sets. However, in the case of feature-based product lines the offering is described 
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referred to as common building blocks (figure 2). With the usage of building-blocks in 
product offerings, segmentation of customer population by functionality is not precise. At 
the same time the number of final product configurations is too huge to develop individual 
forecasts with high accuracy. This has led to an alternate strategy in firms such as IBM PC 
Company where forecasts for individual building-blocks are generated instead of forecasts 
for customer demanded configurations. This process is well suited for features such as 
microprocessors and disk-drives which are included in most customer orders. For example, 
in the personal computer industry forecasters use the demand for individual commodities 
such as Intel’s 486 processors and 4 M DRAM chips. Both end-product forecasting and 
building-block forecasting have their own advantages. The advantages of end-product 
forecasting are the following; 

(1) It eases mapping to end-customer population segmentation; 

(2) Characterization of demand dependencies among subsystems and sub-assemblies is 
easier; 

(3) It eases the financial planning and monitoring at assembly plants who perform their 
calculations at the end-product level; 

(4) It facilitates predicting customer service at the end-product level for a pre-specified 
safety stock allocation. 

On the other hand, building-block forecasting has its own advantages which are the 
following: 

(1) The forecasts at the building-block level are more reliable because they are aggregated 
across all the products; 

(2) Industry level forecasts are easily available for commodities or building-blocks. 

Approaches that combine the advantages of end-product forecasting and commodity 
forecasting do not exist and need to be developed. This is particularly essential in businesses 
which sell whole-systems, subsystems, sub-assemblies and parts. 

4. Operational challenges 

An increase in product variety and customization leads to a proliferation of components and 
sub-assemblies as well as increases complexity in the manufacturing process. Empirical 
research indicates that the degree to which the above factors influence costs, depends on the 
manufacturing process and techniques utilized. For example, Kekre & Srinivasan (1990) 
and Fisher et al (1993) indicate that broader product lines do not increase manufacturing 
cost significantly. On the other hand, MacDuffie et al (1996) and Banker et al (1990) find 
that product variety and complexity have a significant impact on operational costs. Kekre & 
Srinivasan (1990) attribute the difference to adoption of better manufacturing techniques 
including set-up reduction, just-in-time manufacturing, increasing commonality in the 
product line and providing operational flexibility. 

In order to make their operations competitive, manufacturers need to pay closer attention 
to issues related to parts planning, final assembly and distribution. We highlight some of 
these issues in the following subsections. 
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4.1 Parts planning 

Parts planning deals with developing detailed production and procurement plans for all the 
components and sub-assemblies that go into the final product. The parts-planning problem 
in a conventional product line, where products are pre-defined by components has been 
addressed by a number of researchers in academia and industry. When demands for end- 
products are deterministic, the problem of determining which products and how many of 
such products to build given constraints on component supply has been formulated as a 
linear program (Dietrich et al 1995). When demands are stochastic, the problem has been 
addressed by formulating it as a stochastic program and solving it by different techniques. 
Exploiting the commonality of components across assemblies through risk-pooling is an 
important feature in the stochastic problem. Industrial size problems have been addressed 
by the solution approach described by (Srinivasan et al 1992), where heuristic solutions 
for the stochastic problem have been proposed. 

In the case of feature-based products, it will not be known a priori which configurations 
the customers will order. The number of feasible configurations could be too huge for 
demand to be specified for each configuration separately. This issue exists in the case of 
detenninistic demands as well as stochastic demands. In the deterministic case the demand 
may be specified by a single number, total volume, for the entire product line. In addition, 
a ratio will be specified for each feature in the product line; the demand for the feature is 
then obtained by multiplying the total volume by the ratio. For example, one might have 
information that a hard disk of type A is used in 60% of orders for machine type 1111. 
This information in itself is not adequate because a hard disk of type A may be ordered 
with a higher likelihood if the product configuration has a processor of type X. Since such 
a specification does not explicitly show any interdependencies which may exist among 
features, this demand representation is incomplete. Consequently, additional assumptions 
are needed to determine parts requirements. Approaches to perform parts planning for 
feature-based product lines in deterministic and stochastic contexts are currently being 
pursued at IBM Research. 

4.2 Assembly planning 

Manufacturers face a tough challenge in providing a large variety to customers under 
conditions where demand is uncertain and a quick response is needed once it is known. 
Some of the leading manufacturers in the computer industry including Hewlett-Packard 
and IBM are trying to delay differentiation of the final product. Delayed differentiation 
(also called postponement) involves storing inventory in semi-finished forms and finishing 
the product configuration by adding components once demand is known. For example, 
HP decided to store inventory in the form of partially configured printers and add power 
supply and documentation at local distribution centres in Europe. This led to a great 
deal of reduction in inventory costs mainly due to risk pooling effects (Lee & Billington 
1993). 



40 


Ramesh Srinivasan and Jayashankar M Swaminathan 


customer orders for final configurations were known with certainty, additional components 
were added to the vanilla box and the product was shipped to the customer. An additional 
constraint here was that the product had to be delivered to the customer within a certain 
time limit from the time the order was placed. In such a situation, it was a challenging task 
to determine (a) how to allocate the assembly capacity between assembling vanilla boxes 
and final products; (b) how many types of vanilla boxes to keep (how many different partial 
configurations), what features to include in them and how much inventory to build each 
period; (c) how to allocate the existing vanilla boxes to satisfy current demand for final 
products. Swaminathan & Tayur (1996) provide an algorithm that can solve industry size 
problem within reasonable time. Their study shows that vanilla assembly process performs 
very well when product demands are negatively correlated and there is a high degree of 
commonality in the product line. Delaying differentiation at the final assembly stage helps 
in improving the performance of the manufacturer as compared to both assemble-to-order 
(all the assembly operations are done only after demand is known) and make-to-stock 
environments under the above conditions. 

An important concern while adopting delayed differentiation is related to the correlation 
between demands for final products as well as the correlation between the demand for 
features. It seems that an ideal vanilla box is made up of features that are positively 
correlated in order to support a set of products whose demands are negatively correlated. 
Another important concern while trying to adopt such a strategy relates to capacity available 
at the final assembly stage. If additional capacity can be acquired at reasonable cost, then 
it may be optimal to operate under an assemble-to-order environment. A make-to-stock 
environment is preferable if the number of end-configurations is limited and demands 
between the products are independent or positively correlated. Swaminathan & Tayur 
(1996) analyse only a single level of BOM for end configurations. The more detailed 
problem involves consideration of features and options within each of these features. 
Research is in progress to address such problems. 

The effectiveness of providing product variety depends to a great extent on the man¬ 
ufacturing process. One of the prime concerns here relates to set-up reduction. If the 
set-up for changing from one type to another type of the product is low then one could 
effectively manage production of a variety of parts or products. For example, Whitney 
(1993) describes how set-up reduction through better design improved the effectiveness 
of operations at Nippon Denso while offering more types of radiators. 

4.3 Inter-plant coordination 

Coordination between different plants of a supply chain is a challenging task. Outsourcing 
leads to complications in terms of coordination with suppliers as discussed in the section 
on parts planning. In addition, production and distribution of parts, sub-assemblies and 
products has many interesting aspects. Cohen & Lee (1988), Cohen & Moon (1990), 
Newhart et al (1993) have discussed supply chain coordination issues under deterministic 
scenarios and a global optimization criterion. Lee & Billington (1993) and Pyke & Cohen 
(1993, 1994) consider stochastic environments and provide approximations to optimal 
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Most of the above work relates to product structures where the bill of material is fixed 
and known in advance. As a result, final assembly, packing peripherals and adding doc¬ 
umentation can be coordinated in advance based on the type of product. However, in a 
feature-based product line, it is a complicated task to plan the merger of the subsystems 
comprising the order with other peripherals and documentation since the contents of the 
orders are not known a priori. In addition, it is more difficult to synchronize the arrival of 
the product and peripherals at the customer site since additions to the product may come 
from other manufacturing units. Manufacturers like IBM have primarily considered two 
alternatives. In the first alternative the product and peripherals are merged in transit which 
implies that they are synchronously received at the receiving dock. Alternatively, they are 
brought separately to a consolidation centre where they are merged. Consolidation centres 
generally have a cross-docking operation where safety stocks are not stored. In the first 
alternative, the challenge lies in effective transfer of information between various units in 
the supply chain so that the order can be completed in transit. In the second alternative, the 
challenge lies in effective coordination of receipt of parts at the consolidation centres so 
that the amount of time spent in the consolidation centre is minimized. It is to be noted that 
the primary difference between the two alternatives is the absence of a physical location for 
consolidation in the first case. For a large organization, there are also concerns regarding 
whether to run the consolidation centres on their own or get all the distribution services 
from an external vendor. 

In addition to consolidating the product, there are also issues related to how products 
should be delivered from distribution centres to retailers. Issues related to where inventory 
should be stored and who should bear the financial burden for it are also very important 
(Anupindi & Bassok 1995). As is evident from the above subsections, operational chal¬ 
lenges faced by a manufacturer are tougher while managing a broader product line. We 
have only highlighted some of the issues and have neglected those related to the shop 
floor such as process control, line-balancing, scheduling and batching. These issues are 
equally important and provide their share of difficulties while managing a broader product 
line. 

5. Conclusions 

In this paper we first introduce the notion of a feature-based product line through an exam¬ 
ple from the computer industry. Subsequently we have highlighted important challenges 
that manufacturers have to face in the areas of planning and operations management. In 
particular, we discussed issues related to forecasting, product transition, upgrading prod¬ 
ucts, parts planning, final assembly and interplant coordination. We also mention some of 
the important problems that manufacturers face for which accurate solution methodologies 
are not available today. 

Our focus in this paper was primarily on planning and coordination issues. However, 
there are other issues related to offering a broad product line some of which are mentioned 
below: 

• Pricing decisions are extremely important for managing any product line because that in 

most part determines the demand generated for products. Pricing decisions are difficult 
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in a feature based product line because of the difficulty in segmenting the market based 
on functionality. 

• Coordination between marketing and manufacturing in order to decide on the right set 
and number of features to be promoted in the product line. 

® Modularity and commonality in the product line offer additional challenges while trying 
to integrate operations aspects in the early part of the design because one has to consider 
the impact of changes in design of a part on the whole product line rather than a single 
product. This could lead to changes in design specifications for even external suppliers. 

• In high-technology industries there is an additional challenge related to choosing the 
right design for modules because the product life-cycles are short and a design that could 
be compatible with more than one generation of products may benefit from economies 
of scale. 

• Number and location of components and final assembly plants and distribution centres 
need to be carefully detennined because of increase in product variety and customization 
of different products. 

Increased product variety benefits customers, while at the same time helping manufac¬ 
turers to adequately differentiate their products. However, in order to successfully offer 
product variety a manufacturer may have to effectively overcome many of the challenges 
identified in this paper. 


The authors thank the referees and the Guest Editor for comments and suggestions that 
have improved the quality of the paper. 
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Abstract. We present here some recent results in single-product, capacitated 
production-inventory systems in discrete time. The key results are: (1) structure 
of optimal policy for single stage systems; (2) analysing via a shortfall process; 
(3) using simulation to optimize; (4) an approximation using tail probabilities. 
We consider periodic demand, and multiple stages - serial, distribution and 
assembly. Related topics of re-entrant flow shops, lead time quotation and value 
of information are also discussed briefly. 

Keywords. Multi-echelon systems; inventory; optimal policies; simulation; 
approximations. 

1. Introduction 

The motivation for this stream of research has come from problems faced by a diverse set of 
companies, such as IBM, Sematech, AMD, Allegheny Ludlum, GE, Proctor and Gamble, 
Westinghouse, Intel, American Standard and McDonald’s. Smaller local (to Pittsburgh) 
companies such as Sintermet, Blazer Diamond, ASKO and Northside Packing have also 
provided several interesting issues to pursue. At the heart of many of the problems is the 
interaction between demand variability and non-stationarity, available production capacity, 
holding costs of inventory (at different locations), lead times and desired service levels. The 
central goal of this research stream is to understand the interactions in simple single and 
multiple stage settings and to provide insights and implementable solutions for managing 
inventories in a cost-effective manner for complex systems. The goal of this paper is 
to introduce in a systematic manner some recent advances in ‘Discrete-time, Capacitated 
Production-Inventory Systems facing Stochastic Demands’ and we limit ourselves to single 
produc! t! setting. 

1.1 Quantitative models for supply chain management 

A modem manufacturing network, consisting of multiple manufacturing facilities and 
several external vendors, can be modelled as a multi-stage, capacitated, assembly system; 



Figure 1. A typical supply chain; IBM assembly. 

see figure 1 for a representation of an IBM supply chain. Until 1991, the only major 
result that was available in capacitated systems was the structure of the optimal policy 
for a single product, single stage system facing a stationary demand process (Federgruen 
& Zipkin 1986a). Even for this case (‘a simple model’), no computational method was 
available to compute the optimal parameters for a given instance. Since then, significant 
progress has been made in this area. In some sense, these models form the backbone of 
quantitative modelling for supply chain management. 

Among the many papers that are now available (since 1991), five papers on this topic - 
single-product, capacitated systems in discrete time - make the following contributions: 
(1) Develop a method to compute the parameters for this simple model; (2) find the optimal 
policy and provide a computational procedure for the case when this system faces a non¬ 
stationary (periodic) demand process (this paper generalises) (Karlin 1960; Morton 1978; 
Federgmen & Zipkin 1986; Zipkin 1989; Ciarallo et al 1994; Morton & Pentico 1995); 

(3) study the stability of a multi-stage capacitated system operated by a base-stock policy; 

(4) develop a computational method to compute good parameters for a multi-stage system 
operated by a base-stock policy; and (5) develop a very quick and accurate approximation 
method for the same problem as above. Re-entrant flow hops, multi-product systems, 
component commonality (and delayed differentiation) and other topics have been studied in 
greater detail since that time. Similarly, several papers and research! t! hemes in continuous 
time models are available as well. 

No attempt has been made to provide a comprehensive literature review; however, most 
references may be obtained from the papers mentioned here. For a thorough survey of 
results (mainly in uncapacitated systems), see Van Houtum et al (1995). Other useful 
surveys and books include Graves et al (1992) and Buzzacott & Shantikumar (1993). 
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2 Basics 

our models, we will assume time buckets. Each bucket can be a day or a week long 
r a month long) depending on the situation. These models will be called discrete time 
odels. Each unit of time will be called a period. This approach is appropriate at the plant 
d system levels. Continuous time models are appropriate at shop-floor and machine 
/els. 

A base stock policy with order up to level z means that we produce in any period just 
ough to reach this target. If we cannot reach it, due to capacity limitations or lack of raw 
aterial, we do the best we can. An echelon base stock policy is exactly the same, except 
at all quantities (mainly inventories) considered are cumulative in order to include this 
ige and all stages downstream (near the end product, close to the customer). 

We will study these systems via a combination of analytical methods and simulation, 
odels are good for simple situations and to grasp concepts. To compute numbers for real 
Drld situations, simulation is preferred. The assumptions made are more realistic and 
e solutions obtained are more believable. Furthermore, certain flexibility that decision 
akers would prefer to have is better handled by simulation. In terms of acceptance by end 
ers, a validated simulation has had better luck than complicated mathematical models, 
shortcoming of simulation as compared to mathematical models is that it takes a much 
nger time to find answers. What we do then in reality is use models to get rough estimates 
id to provide intuition to fellow team members; then choose an alternative that shows 
ost promise; finally, we simulate to get accurate solutions. 

3 Literature survey: Papers before 1991 

lark & Scarf (1960) developed a periodic review inventory control model for a serial 
stem without setup costs. By using a discounted cost framework, they established that 
i order up to policy at each node is indeed optimal. Federgruen & Zipkin (1984) extended 
ese results further. Muckstadt et al (1984) conducted a computational study using the 
lark & Scarf (1960) model. A continuous review version of the Clark and Scarf model is 
udied by Debodt & Graves (1985). An in-depth analysis of an assembly structure with 
ily two inputs, again by using the discounted cost framework is presented by Schmidt & 
ahmias (1985). Rosling (1989) showed that under some initial conditions, an assembly 
'Stem can be reduced to a serial system with modified lead times so that the results of 
lark & Scarf (1960) may be applied to this equivalent serial system. 

A model of a production and distribution network in which manufacturing is modelled 
/ a single node is described by Cohen & Lee (1990). It also differs from much of the earlier 
ork in an important way in that decentralized control is assumed and the model itself 
a framework that combines separate models of production and distribution. A supply 
lain planning model that can be used to study production scale and scope economics is 
esented by Cohen & Moon (1990). 

A model for supply chain management which assumes decentralized control at each node 
the manufacturing network, controlled by periodic review order up to inventory policies 
presented by Lee etal (1991). Once the service levels are set for each node, the overall 
lationships between cost and service can be obtained by applying this model. Although 
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capacity considerations are not addressed, they allow for uncertainty in the supplier lead 
times. 

The literature on inventory control systems and production-distribution systems is 
extensive and hence we limit our review to the work that is closely related to the theme 
of this paper. Similarly, there is a vast body of literature on single location production- 
inventory systems (addressing many aspects of interest) that is not reviewed here. 

The Clark & Scarf (1960) model and many of its extensions, including the one by Rosling 
(1989) analyse the model within a discounted cost framework. These results are fairly 
involved and further, the computational procedures are not easy to describe or program. 
The assembly system inventory control problem in an average cost framework is studied 
by Langenhoff & Zijm (1989) and Kamesam & Tayur (1993). This analysis leads to 
an exact decomposition of the assembly system into several single location problems. 
Even this decomposition is not easy to handle, but Van Houtum & Zijm (1990) describe 
computational approximations that lead to a simplified computational procedure. 

Except for that by Federgruen & Zipkin (1986), not much work was done in capacitated 
systems in discrete time. In this survey paper, we will begin with their model and then 
describe recent developments of several extensions of this basic model. The notation we 
use may change between sections to remain consistent with the papers that are being 
summarized. 

2. Single-stage, single-product models 

The first progress since 1991 was the introduction of shortfall \ and the connection that 
was made between the capacitated inventory model operated under abase stock policy and 
a dam model that has been studied extensively by applied probabilists. 

2.1 Basic model 

Tayur (1993) provides a method to compute the optimal policy for a basic inventory 
problem addressed previously by Federgruen & Zipkin (1986a). We are to determine the 
base stock level of a single item at a single location under periodic review, when 

• the unit variable cost of purchase is c per item and there are no fixed costs; 

• the holding cost (h) and stockout cost {p > c) are per period and per item; 

• demands in successive periods are non-negative i.i.d. random variables with known 
distributions (labelled by d)\ 

• there is an infinite horizon and the costs are not discounted; 

• all demands that are not satisfied by stock on hand are backordered; 

• there is a finite production capacity, C, in every period; 

• the cost is computed on the amount of inventory or backorder at the end of each period; 


^Our thanks to a referee for pointing out that this shortfall type connection was known to queueing theorists before 
1991. 
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« we are to minimize the expected cost of holding plus penalty per period. 

Recall that inventory position is defined as (stock on hand) + (stock on order) — (back¬ 
orders). An order up-to (or a base stock) policy with a critical number z is one in which 
the inventory position (x) is raised to z if x < z, and no production is done if x > z. 

Unlike previous approaches, we provide a different construction of the sequence of 
problems that converge to the problem of interest. In particular, we do not consider finite 
horizon problems of the capacitated problem and then take the limit as the number of peri¬ 
ods go to infinity. Rather, we have a sequence of uncapacitated, multi-stage, serial infinite 
horizon versions that converge to the desired system. We use results from uncapacitated 
multi-stage serial systems coupled with results in storage stochastic processes. 

Specifically, our steps are the following. First, we show that the inventory model of 
interest is equivalent to a problem in dams. This suggests an analysis based, not on the 
evolution of (inventory on hand minus backorder) process but, rather, by considering a 
shortfall process. The shortfall is defined as the amount on order that has not yet been 
produced because of the capacity constraint. If X,i is the shortfall at the end of period /?., 
then Xfj = max(0, dn — C), where is the demand in period n. 

It is important to differentiate between backorders and shortfalls: the former represents 
what the customer did not obtain, while the latter represents what the manufacturer could 
not produce because of the capacity constraint. Thus, the backorder at the end of j^eriod 
n equals max(0, Xn — z) where z is the order up to level. The penalty cost p is on the 
backorder; there is no direct penalty on shortfall. Similarly, the amount of inventory at tlie 
end of period n is max(0, z — A'„). The cost in period n, therefore, equals pmax(0, Xn — 
z)-t-/ 2 max( 0 , z - A,i). 

Second, we show that we can replace the single-stage capacitated inventory model by 
constructing a specially structured uncapacitated infinite-stage inventory model: This is 
simply a mathematical artifact. The sequence of multi-stage problems alluded to above 
will converge to this infinite stage system. 

2.2 Connection with a dam model 

Figure 2 shows the sample path of a typical single-stage capacitated inventory system 
under periodic review that is operated by a base stock policy where excess demand is 
backlogged. The capacity (C) is 30, the order-up-to level (z) is 45, and the inventory at 
time 0 (/q) is 10. Let d^ = 15, di = 9, d^ = 37, d^ = 21 be the demands in the first 
four periods. Figure 3 shows the sample path of a dam (see Prabhu 1965, 1980) that has 
an infinite height, a release capability of at most C, and an initial water level of 35. Let 
the rainfall in the first four periods be 15, 9, 37, and 21. The dam releases as much water 
as it can, and if the water level is less than C, the dam becomes empty. The equivalence 
of the two sample paths is obvious. Let (< 3 ) 4 . stand for max(a, 0). If Zn is the content 
of the dam in period n just after release, then it satisfies = (Z„_] — C)+ 

and if Xn is the amount on order in period n that has not yet been produced, it satisfies 
Xn — (Xn~ \ -\-dn-] — C)+ (a similar recursion arises in the study of aD/G/1 queue also). 
Note that {Xn,n = 1,2,...} is a Markov chain. This motivates us to study the capacitated 
inventory system in terms of the process Xn, and provide results in terms of the steady 
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The order-up-to level is 45 
The capacity is 30 

Negative inventories represent backorders 

X, is the amount in period t that is on order but not yet produced 

Downward sloping arrows represent demands; upward arrows denote production 

Figure 2. Sample path of inventory model. 

State distribution of X — lim„_»co Table 1 summarizes the equivalence between the 
capacitated inventory model and the dam model. 

2.3 Computing the optimal base-stock level 

Let K{x) be the distribution of the input to the dam in any period. If F(x) is the steady 
state distribution of the water content in the dam just after release, then the optimal value 
of the order up to level, z, in the capacitated inventory system satisfies 

{F ^ K)iz) = p/ip -^h), (1) 

where * represents convolution. Fix) is known for all discrete distributions of the demand, 
and for Erlang distributions. Intuitively, we are adding the following two independent 
random variables: (1) demand in a period and (2) the amount on order at the beginning of 
the period that has not yet been produced. The necessary and sufficient condition for the 
distribution Fix) to exist is that the expected demand (input to the dam) in a period be 
less than the capacity, C. Intuitively, the result states that it is the sum of two independent 
random variables (demand in a period and the amount on order not yet produced at the 
beginning of that period) that adds up to z. Penalty p is incurred if this sum crosses (at the 
end of the period) z and is proportional to the excess, and a holding cost (/i) is imposed if 
the sum is less than z and is proportional to the amount on hand at the end of the period. 

Example 1. If the demand is exponentially distributed with mean rate X iKiu) = 1 - 
u > 0; Kiu) = 0, otherwise) and the capacity is C, then the steady state distribution 
of the water content in the dam just after release is given by 
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WATER CONTENT 



Upward arrows represent rainfall 
Downward arrows represent release 
One cannot release more than the water level 

The danger level is simply shown to connect with the inventory model 

Figure 3. Sample path of dam model. 


where p is the largest positive root of the equation 
p = X~ 

Thus, the optimal order upto policy has a critical number and is obtained by solving 
iF>^K)iz*) = p/(p + h). 

Independently, the similarilty of the basic inventory model operated via a base stock 
policy to a D/G/1 queue is recognized by Van Houtum & Zijm (1994). 

2.4 Optimal policy for an extended model 

In the previous subsection, we computed the optimal base stock level. We now want to 
show that among all policies, a base stock policy is optimal. Federgruen & Zipkin (1986a) 
show this for the stationary case above. Kapuscinski & Tayur (1996a) provide a proof for an 
extension that allows for periodic demand as well as capacities by considering the following 
variant of the basic single-stage, single-item, discrete-time production-inventory model. 
The demands (stochastic) follow a periodic pattern with a period K. As before, there is 
a maximum production capacity (C) in any period; demands not satisfied in a period are 
backlogged to the next; there is holding cost Qi) per unit of inventory per period and a cost 
of penalty (p) per unit of backlog per period. We want to find policies that minimize the 


Table 1. Comparison of dam and inventory models. 


Dam model 

Inventory model 

Maximum release 

Capacity 

Water content 

Amount not yet produced (shortfall) 

Empty dam 

Order-up-to level achieved 

Rainfall 

Demand 

Danger level crossed 

Backorders 
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finite-horizon costs, the infinite-horizon discounted cost and the infinite-horizon average 
cost (respectively) of operating this stage. 

For all the three cases - finite-horizon cost, discounted infinite-horizon cost and infinite- 
horizon average cost - we show that an order up-to (or base-stock, critical-number) policy 
is optimal. This extends the results of Karlin (1960b) and Zipkin (1989) for uncapacitated, 
non-stationary models and Federgruen & Zipkin (1986) for capacitated, stationary models. 
Our proof for the finite-horizon case follows standard steps. The proof of optimality for 
the infinite-horizon discounted case is simpler than that provided by Federgruen & Zipkin 
(1986) because we are able to use more recent results from Bertsekas (1987). To provide 
the optimality proof in the average cost case, we use the framework of Federgruen et al 
(1983), but our approach is different from that used by Federgruen & Zipkin (1986a) for 
the stationary case. All proofs can be found in Kapuscinski & Tayur (1996a). 


2.5 Sequence of results leading to optimal policy structure 

The following is the sequence of events at the beginning of a period: (1) some inventory or 
backlog exists; (2) a decision to increase the inventory is taken (limited by the production 
capacity); and (3) demand arrives. Holding or penalty costs are charged on the inventory 
after demand arrives. The notation is mostly standard. We have suppressed the time sub¬ 
script in X, y and d below and these are assumed to be reals unless mentioned explicitly 
as integers. We will write them when necessary. We define 

• x: inventory at the beginning of a period; 

• y: inventory after ordering, but before demand arrives. 

We assume that Edi < oo for all period types i = , K. 

2.5a Finite horizon: As is standard practice, we begin with the finite-horizon case, 
which is not only the simplest situation to consider but also sets the framework for the 
infinite horizon cases. 

Let 0 < ^ < 1 be the discount factor. Define (recursively) i'n(x) = minimum total 
expected discounted cost with a time horizon of n periods. Note (in this subsection only) 
that we start in period n and count downward towards period 1, the end of the horizon. 
The demand in period n is one of the K period types (/ = !,... K). We may assume that 
period 1 has type 1 demand and period n = K 1 has type 1 demand again and so on. 
Thus: 

iJo(-) = 0, 

Vn(x) = inf {c(y - x) -b L„(y) -f fEd„Vn-.i(y - dn)], 

(x,y)eA 

for X e R and n >1, 


A = e R^\x < y <x + C} = U 7(x) 

xeR 


where 
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is the feasible set. For x e R, 

Y{x) = [x,x + C], 
and for y e R, 

Ln(y) = Ed„{h(y - r4)+ + p(y - dn)~). 

We can express Vn using additional functions and In as follows. 

^n{x) =—CX Inix), X e R, n > I, 
ln{x) = mini Jn(y) : e FCj:)}, x e R, 

Jniy) = cy + Lniy) + fiEvn-]iy - dn), y e R, 

with uo(-) = ^o(-) = k(-) = 0. 

Our first lemma generalizes theorem 2 of Federgruen & Zipkin (1986b) to the cyclic 
case. The proof is a direct adoption of theirs. 

Lemma 1. The set of feasible pairs (x, y), A (as defined above), is convex. For all n € N: 

(a) The expected sum of holding and penalty costs, Lniy), convex; 

(b) Jn, Vn, <^nd In are convex; 

(c) Vn > 0; and 

(d) For n > 1.- Vn(x„) ->• oo when |jr„l —> oo, and if p > c then Jniyn) oo when 
ly„l 00. 

Theorem 1. Let y* he the smallest value minimizing Jn. The optimal policy in period n 
is order up-to y*. As lim\y^^\-^oo Jniyn) = oo, we have y* < oo. 

As a first property, we have the following. 

Property 1. For any x e R, VnK+ii^) is increasing in n. 

Proof. We show by induction that Vm+K C-^) > Vm (.^) for all x and for all m. First, Vf;ix) > 
0 = i'o(jc). Let VK+mix) > Vmix) for a certain m and all x. Then, 

min idy - x) -h LK+m-i-liy) + l^EvK+miy - ^K-hm-hl)} 

y€V(x) 

> min idy - x) -h Lm+iiy) + jBEvmiy - dm+i)} 
yeY(x) 

= Vm-hli^)- ° 

Note that the convexity of functions Jn, In, Vn implies continuity of these functions. One¬ 
sided derivatives exist at all points. Also, two-sided derivatives exist with the exception of 
a denumerable set of points. Points where two-sided derivatives do not exist are generated 
by mass points of demands. Although derivatives do not have to be continuous, they are 
monotonic and bounded on any compact set. Therefore, in this paper we will define them 
as right-hand-side limits. We will use a prime (') to denote these derivatives. 

We define the myopic solution to period i, yi, as the one that satisfies c L^iyt) = 0. 
Property 2 provides a simple lower bound as in the uncapacitated case; see Zipkin (1989). 
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Property 2. Assume that period ] has the minimum myopic solution; y\ = min{y/: 
i = 1,..., A'} where y/ is the myopic solution for period i. Then: y* > yi, 'in. 

Strictly speaking, our next result is not a finite-horizon result. However, this appears to 
be the most appropriate point to state it. Part (c) of the technical lemma (lemma 2) is used 
for the next result. We will use this property in proving property 6 in § 2.5a below and 
when analysing the infinite horizon average cost case in § 2.3. 

Property 3. For a given 0 < jS < 1, the sequence y* is bounded. That is, limsup{y*: 
n E A''} < 00. 

Property 4. For any finite-horizon (n periods) problem consider a policy that produces 
up-to Zi in period i = 1,..., n. For all i, the cost of such a policy is convex in z/. 

2.5b Infinite horizon: Discounted model : We now move to the infinite-horizon dis¬ 
counted case. It was natural in the finite-horizon case to label periods as time to the end of 
the horizon. In the infinite-horizon case, we typically start the process at some point and 
continue indefinitely. To make the notation more intuitive, starting from this subsection, 
we will number periods in increasing order, following period n, we have period n 4- 
Federgruen & Zipkin (1986b) showed the next result for the stationary, capacitated case. 
We provide a simpler proof as we are able to use results from Bertsekas (1987). 

Theorem 2. LetQ < < ] (discounted case). The optimal policy for the infinite-horizon 

is cyclic up-to level policy. 

Since E{di) < oo for / = 1,..., K, we have the following. 

Property 5. Let w,dx, i) := VnK-\-i(-^) fo"' ^ , K, n e Nq := N D [0], and 

X E R. The limit, lim„_^oo w«(x:, i) = w(x, i) (where values of w are in A U {oo}), exists 
and w < 00 . 

w < 00 does not imply that stationary up-to levels are finite. However, it is possible to 
prove the following (since £"(<7/) < oo for / = 1. K). 

Property 6. The up-to levels are finite (i.e. zi < oo). 

2.5c Average cost criterion: This case is the most difficult one to analyse. We need 
some technical results before the optimality of base-stock policies can be proved. 

Property 7. (Convexity) Consider the infinite-horizon case (with cyclic demands) and 
the class of up-to level policies, where levels z,- for = 1,..., AT are period-type specific. 
For both the discounted cost and average cost criteria, the average cost is finite and convex 
in each of Zi ’s. 

Let Zniax and Zmin be the maximum and minimum resnectivelv among the levels for a 



sianing pomis x-^ ana respectively in perioa iq, mere exists i^witn proDaoiiity i j a 
period n such that starting from this period the two processes coincide. 


Property 9. (Shortfall stability) Consider an up-to policy with a vector zi,..., zj;;:. If 
Edi < KC then ESi < oo for all I’s, where Si = Zi — yt is the shortfall in period 

type/. 

Average cost criterion is easy to analyse when either the number of states the process can 
take is finite or the one-period cost function is bounded (see Bertsekas 1987). Conditions 
when an optimal policy exists for semi-Markovian process with average cost objective 
function with denumerable state space and unbounded one-period cost function are derived 
by Federgruen et al (1983). These conditions were used by Federgruen & Zipkin (1986a) 
to derive optimality of up-to policy for a capacitated stationary model. We extend it to a 
cyclical model as follows. 

We first show that any policy can be dominated by a policy that requires reducing (any) 
backlog and not exceeding some stationary level A*. Then we show that among such 
policies, the up-to policy is optimal. The main structure of our proof is based on results of 
Federgruen et al (1983), but the proof tliat the required conditions are satisfied is shown 
by a different method as compared to that by Federgruen & Zipkin (1986a). 


Fact 1. If an optimal policy for the problem exists (including possibility of randomized 
policies), then it has the following form: 

(a) for X < -C, y = jc -f- C. 

(b) there exists A* < oo such that for y > A* for all period types it is better to produce 
nothing rather than take any other action. 

Theorem 3. Consider a capacitated system with cyclic discrete demands and linear 
ordering, penalty, and holding costs. For the average cost criterion, the cyclic up-to policy 
is optimal. 

Lemma2. Considerthepolicy 8[0] (produce up-to 0). Let Edi < KC andE{di)^^^^ 
< 00 for a certain k > I and for alii = \,..., K. Consider a point process defined by 
points i,for which yi =0 (no backlog of previous demands). Let N be a random variable 
equal to time between two consecutive points of this point process. Then for any starting 
period-type: 

(a) £(iV^) < oo 

(b) E(\yi\^) < oo and E(\xi\^) < oo 

(c) If 0 < ^ < 1 then £(ES=o E(Y1T=0 < ooforanyxQ. 

(d) For 0 < < 1, there exists A e R, such that 7„(0) < A„. 
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2.6 Basic properties 

We show several properties of the optimal policy including the following; (1) capacity 
smooths the base-stock levels in a manner that is different from that due to holding costs; 
(2) the limit of finite horizon order up-to levels are bounded; (3) the optimal levels are higher 
than the minimum of the K myopic levels; (4) in an infinite-horizon average cost case, the 
optimal levels are lower than the maximum of the K stationary optimal levels and higher 
than the minimum of the K stationary optimal levels; (5) if demands are stochastically 
larger or the capacity is lower, the base-stock levels are higher; and (6) for K = 2, as 
the penalty cost is increased, the difference between the maximum and minimum levels is 
bounded by C under fairly general assumptions on demand distributions. 

2.7 Computational technique 

Exact computation of optimal levels by analytical formulas appears difficult. We provide 
a simulation based method using infinitesimal perturbation analysis (IPA) to find these 
levels. The basic idea is simple: instead of using the derivative of the expected cost in a 
gradient search method, we use the expected value of the sample path derivative (obtained 
via simulation). To validate this approach and prove the optimality result for average cost 
case, we derive several technical properties of base-stock policies - convexity, regenera¬ 
tion, coupling and stability. See Glasserman (1991) for an excellent reference and a later 
subsection for details of a multi-stage system analysed in this manner. A numerical study 
indicates that our IPA method is robust and finds solutions within a few minutes on a 
workstation. The steps are similar to those described later in § 3 for a serial system, and 
so we do not detail them here. See Kapuscinski & Tayur (1996a). 

2.8 Insights into some complex issues 

We also numerically study several issues that provide insight into the behaviour of optimal 
solutions. Examples of issues studied include the following: (1) what is the increase in 
cost if all periods are forced to have the same base-stock level? (2) what are the benefits 
of changing capacities in each period based on the demand type? (3) how many periods 
are affected by smoothing of the levels? (4) how are the above results affected by high 
penalty cost or high utilizations? (5) what is the relationship between service level and 
costs? Several of the qualitative and technical properties in our capacity setting differ from 
the uncapacitated non-stationary model. 

See Kapuscinski & Tayur (1996a) for extensive computational testing. The basic insights 
ai'e as follows. 

(1) Basic insights. Several basic properties of the system confirm what we expect. All 
other things being equal: 

• with an increased mean demand, the up-to levels increase, 

® with decreased capacity, the up-to levels increase, and 

• with increased variance of demand, the up-to levels increase. 
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(2) Effect of capacity and range of demands on ‘smoothing By smoothing, we mean that 
values Zi are affected by demands and the capacity of period j i. The difference 
between and is lower as compared to the difference between the maximum 
and the minimum of K independent stationary capacitated models; so some higher 
levels are brought down and some lower levels are lifted up. We have two types of 
smoothing working in opposite directions: (a) in anticipation of high demand, some 
levels ai’e lifted up, and (b) in anticipation of low demand, other levels ai'e decreased. 
The second type was described by Karlin (1960b) and Zipkin (1989) in the uncapaci¬ 
tated setting. The first one is induced by finite capacity, and the second by the holding 
cost. 

At times there are changes in the ordering of the levels as capacity decreases. For ex¬ 
ample, the period with the minimum level may be different depending on the capacity. 
Furthermore, although the differences between levels do not disappear as capacity is re¬ 
duced, the levels increase and the ratio between the maximum and the minimum of the 
levels gets closer to 1. 

(3) Constant base-stock versus optimal (period wise) hase-stock. We compare the cost 
of the optimal constant base-stock policy to the optimal cost. (Note that the cost is a 
convex function of this constant level and the simulation based method remains valid.) 
Not surprisingly, the optimal constant z lies between the minimum and the maximum 
optimal base stock levels When capacity is tight, the cost ratio (say R) becomes close 
to 1. We note that in some cases the difference Zmax — 2min increases and then 
decreases as capacity decreases, while in other cases it only increases with capacity. 
Even in cases where Zmax — ^min of optimal policy increases as capacity decreases, 
R monotonically decreases to 1. One explanation is the fact that the cost function is 
relatively flat around the optimum (see item 8 below), and more so as the capacity 
becomes tighter. 

The effect of variance on R is as follows. R is close to 1 in most cases. A case when the 
cost ratio is large has two small but different variances (example: A^(70, 1), N(10, 10^) 
in a K =2 situation). With increase in vaiiance the levels increase and the ratio of costs 
goes to 1. We can also show that even with significantly different means, the ratio of costs 
goes to 1 when at least one of the variances is high. 

(4) Changing capacity with periods according to period type. Rather than remain constant, 
capacity can vary according to period type. (The cost is still a convex function of 
(zi, ..., zk) and the optimal policy is still order up-to. The simulation-based method 
remains valid and recursions can be easily adapted.) Several capacity allocations for 
four situations each with K = 2 were tested: (i) both period types have low vaiiance; 
(ii) period with lower mean has a high variance; (iii) period with higher mean has a 
high variance; (iv) both periods have high variance. We are able to test if two natural 
alternatives to constant capacity (with the same total capacity KC in o. cycle) - (a) 
proportional to mean demand and (b) proportional to (mean -f constant * standard 
deviation) of demand - do well. 
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cases, but in these cases nearly all spreads did well. In case of high variance, a proportional 
(based on mean and variance) spread did much better than one based on mean alone, but 
surprisingly it was possible to find cases when it was optimal to allocate more to a period 
type with a demand having both a smaller mean and a smaller variance. We think that in 
these cases there is typically sufficient capacity in low demand period to raise the inventory 
to optimal level in higher demand period, but the effect of a surge in demand in high demand 
period can be repaired faster when a little bit more capacity is assigned to low demand 
period. This is seen very clearly in the following example: let = 4 with demands wi! 
th! means and variances equal to (20,1), (20, 1), (90, 50), (20, 1) respectively. We find 
that the optimal capacity allocation is (22,22, 93, 143). So the use of the term ‘recovery 
capacity’ seems appropriate. In all cases, allocating equal capacity to periods did very 
well. 

(5) Service level In all the experiments -constant capacity, equal base-stock levels, period- 
wise capacity allocation, different AT’s, different demand distributions, different hold¬ 
ing and penalty costs - the type-1 service, defined as P (inventory after demand > 0), 
at optimality, was p/p h. See Tayur (1993) for a proof of this connection between 
service level and costs in the single-stage capacitated stationary case, and Glasserman 
& Tayur (1995) for the multistage stationary capacitated case. 

(6) Increasing penalty cost. With increased penalty the up-to levels increased. Obviously 
larger differences in means and variances caused relatively larger differences in up-to 
levels, but this effect was small over a range of penalty costs. For most part, the levels 
rise in parallel. Furthermore, in all our experiments, the ordering of the levels did not 
change as a function of p (unlike item 2 above). We also noticed that the difference 
between the maximum and the minimum levels at optimality was bounded by C for 
/if = 2 as we increased p. Property 12 below provides an explanation. 

(7) Mean utilization V5. variance. As penalty cost is increased, the optimal up-to levels of 
low variance system with high utilization increase faster than in a system with high 
variance and lower utilization. This indicates that the rate of increase of base-stock 
levels depends on both the variance and excess capacity. 

(8) Cost sensitivity. Cost is not very sensitive to the up-to levels around the optimum. This 
was noticed in the stationary multi-stage system studied by Glasserman (1991) also. 

3. Single-product, serial system 

It has been shown that for a serial capacitated system, the optimal policy (under the cost 
criteria discussed above) is not base-stock in general; see Speck & Van der Waal (1991). 

3.1 A simulation-based optimization procedure: Single-stage simulation 

Let s be any base stock level; the optimal value for s is what we eventually need. The 
notation used is consistent with the papers referenced on this topic where detailed proofs 
are available. 



Figure 4. Illustration of a single stage. 


We need notation. 

/„ = inventory - backlog in period n\ 

Rfi = production in period n\ 

I = leadtime from production to inventory; 

Tn = Rn-\ + • • • + Rn-t 

= pipeline inventory; 

Dfi = demand in period n\ 
s = base-stock level; 
c = production capacity. 

We always assume that c > 0 and > 0 for all n. Under a (modified) base-stock policy, 
the production level in each period is set to try to restore the inventory position /«-f- Tn — 
to 5. If production were uncapacitated, this would be achieved by setting Rn = T>n- Since, 
however, cannot exceed c, it may take multiple periods of production to offset demand 
in a single period. 

The system evolves as follows: 

~ Dn “b Rn—i^i 

R„=cA[i + D„-(/„ + r„)r, 

Tn+1 = + Rn Rn—C- 

denotes max(0, x), x~ denotes max(0, —x) and a Ab denotes min(fl, b). 

What about costs? In period n the cost will be * "^he average cost over 

the long term will be Cn)/N for a large A^. is the number of periods simulated. 

3.2 A simulation-based optimization procedure: Single-stage gradient 

We can certainly find the gradient of expected cost by simulating one more time and 
changing (say, increasing) only the order up to level by 1; the difference in cost between 
the two simulations is the gradient. This would be too much work especially if we had 
several stages whose order up to levels we want to optimize. Can we get the gradient by 
not doing any more simulations? Yes; and here goes the basic trick. 

(1) If we had inventory in period n, having started at a higher base stock level means an 
increase in holding by h. 

(2) If we had backlog in period n, we will decrease penalty by p. 


( 2 ) 

( 3 ) 

( 4 ) 
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Thus, the changes are either h or —p. We will just average these changes. The point is we 
are claiming that average of changes is the change in the average. In this situation, and in 
several others, this claim is good. 

How do we make the computer do this? In principle, we are Just differentiating the 
recursions. The easiest way is to write the following two lines in the code to be executed 
in every period: 

dCost = dCost + /z, if /« > 0, 
dCost = dCost — p, if In < 0. 

Then dividing dCost by N provides the derivative. Note that at the beginning of the simu¬ 
lation dCost is set to zero. 


3.3 A simulation-based optimization procedure: Serial system 


3.3a Operation : We now link multiple stages in series. There are m stages: stage 1 
supplies external demands and stage i supplies components for stage i — I, i = 2 ,..., m. 
Stage m draws raw materials from an unlimited supply. To specify an echelon-inventory 
base-stock policy for the system, we let 

s^ — echelon base-stock level for stage i. 

Naturally, we require s^ < s^ < • • • < 5 '”. Let the variables and c' have the 

same meaning as before, applied to stage /. For i — 2,..., m, let 7;^ be the installation 
inventory at stage i, and let /J = /„, with /„ as in § 2.1. In period n, stage i sets production 
to try to restore the echelon inventory position, 

+ Ti) - D„. 

7=1 

to its base-stock level 5 '. 

Two features distinguish the multi-echelon system from a single stage: production at 
stage i, i < m, is constrained by available component inventory 7^+^ as well as by the 
capacity limit c'; and for / > 1 the amount removed in period n from the store at stage i 
is the downstream production level 7?^"^, rather than the external demand. Thus, for stage 
i = 2,..., m — 1 we have 


ln+l=In-K~'+K-t:’ 


RL = c’ A 


5 ' + D„- £(// + Ti) 


7=1 


+ 


A 7, 


/+l 


'ri _ Tri I ni pi 


(5) 

( 6 ) 
(7) 


At stage m, raw materials are unlimited so the last term in (5) is absent. To subsume these 
special cases in (5-7), we take 7?® = Z)„ and 7^+^ = 00 for all n. To complete our 
specification of the model, we need initial conditions; for simplicity, we take 11 — s^, 
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= s^ — s^~^, i = 2,..., m, and all other variables zero. In other words, the system 
urts with full inventory. For details see Glasserman & Tayur (1995). 

Similar to the single stage case, the derivatives with respect to the base stocks can be 
mputed. 

1 Validation of technique 

lidation of finite horizon derivatives - inventory and costs - is quite straightforward, 
e show that (right-side) derivatives exist with probability one at a given value of .9, then 
peal to Lipschitz continuity and finish by applying the dominated convergence theorem, 
e Glasserman & Tayur (1995) for details. 

5 Stability and recurrence 

r IPA to work in the infinite horizon, several conditions have to be satisfied by the 
derlying stochastic process. These are derived by Glasserman & Tayur (1994). When 
pacity limits are introduced, an ineffective policy may lead to increasingly large or- 
r backlogs: the stability of the system becomes an issue. In this paper, we examine 
s stability of a multi-echelon system in which each node has limited production ca- 
city and operates under a base-stock policy. We show that if the mean demand per 
riod is smaller than the capacity at every node, then inventories and backlogs are 
ible, having a unique stationary distribution to which they converge from all initial 
ites. Under i.i.d. demands we show that the system is a Harris ergodic Markov chain 
d is thus wide-sense regenerative. Under slightly stronger conditions, inventories re- 
m to their target levels infinitely often, with probability one. We discuss cost implica- 
)ns of these results, and give extensions to systems with random leadtimes and periodic 
mands. 

5 Extensions 

it us discuss two obvious extensions. 

5a Assembly system : In an assembly system, each node i requires components from 
set of predecessor nodes. These are assembled into stage-/ finished goods. By changing 
lits, if necessary, we may assume that components from predecessor stages are assembled 
equal quantities. 

To keep the notation simple, we consider a representative example, rather than the 
neral case. Figure 5 depicts a three-node system in which node 1 assembles components 
pplied by nodes 2 and 3. Node 1 feeds external demands; the other nodes draw raw 
aterials from infinite sources. The evolution of inventory at node 1 is characterized by 

^'+1 = 4 ' - 

i?; =C> A [s' + D„ - (/„> + p)]+ A A ll (8) 

T’l _T’l I pi pi 

le assembly feature is reflected in the dependence of on and 1^ in (8). Nodes 2 
id 3 are characterized by the basic recursions (5-6), with obvious modifications to the 
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indexing. The only notable difference is that now and 7,^ are decreased by the same 
production level each period. 


3.6b Distribution system Another variant of the serial system allows intermediate stages 
to supply multiple lower-echelon stages, typically in a tree topology. Our results extend 
without difficulties to such models. For ease of exposition we describe a less general setting 
- a serial system in which each stage faces external demands for components in addition to 
internal demands from the downstream stage. A manufacturer of electronic equipment, for 
example, may face demands for integrated circuits, and for circuits assembled into circuit 
packs, along with demands for finished goods. 

To characterize the operation of such a system, we need to specify how each stage 
allocates inventory to internal and external demand. Rather than restrict ourselves to any 
one policy, we describe a class of policies consistent with our results. Let be the stage-j 
inventory reserved for external demands at stage / in period n, and let = //j — be 
the inventory available for downstream production. Suppose stage i has base-stock levels 
Sq and to supply external and internal demands, and let s^ = .v^ -f s \. Denote by DJ, the 
external demand at stage i in period n. 

The operation of stage 1 is unchanged. At stage /, production is now set according to 


= c‘ A 




7=1 


1 + 


/./ _ T’) 


A / 


f + l 
]/) • 


Also, 


fl _ ji n 

^n+\ ''n 


i-\ 


[A'A4r + ^;,_,- 


Pipeline inventory follows (5), just as before. It only remains to specify how and 
are determined. 

A broad class of policies sets 





si, + D'„ - H, 


On 
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and /q^ = , for some function g'; in other words, the inventory reserved for internal 

production is a function of the available inventory and of the shortfalls in meeting internal 
and external demands. The choice of g' determines the particular policy. 

We briefly describe two types of policies. The first type gives strict priority to internal or 
external demands, allocating as much inventory as needed to the high-priority demand and 
using any excess for the other. The second type attempts to balance shortfalls, allocating 
inventory to minimize the resulting difference between inventories and base-stock levels. 
This type of policy implements relative priorities through the choice of s’q and s\ . The 
assumption that the system can, in fact, be balanced in each period is essential to many 
analytical approaches; our setting, of course, does not require this. For both types of policies 
just described, the particular functions g' are tedious to write out but they are sufficiently 
smooth to allow derivative calculations. 

3.7 An approximation using tail probabilities 

The IPA method could be computationally prohibitive for large problems, or for problems 
where the starting solution for the simulation is far way from its eventual optimal. So a 
quick (and accurate!) approximation is presented by Glasserman & Tayur (1996): Recall 
that our objective is to find base-stock levels that approximately minimize holding and 
backorder costs. The key step in our procedure approximates the distribution of echelon 
inventory by a sum of exponentials; the parameters of the exponentials are chosen to 
match asymptotically exact expressions. The computational requirements of the method 
are minimal. In a test bed of 72 problems, each with five production stages, the average 
relative error for our approximate optimization procedure is 1.9%. 

Much of the technical development of this approximation is based on the work by 
Glasserman (1994), who uses techniques from Asmussen (1987), 


4. Brief summary of related topics 

(1) Re-entrant flow shops. How do we handle multi-product re-entrant flow lines, a very 
typical topology in semi-conductor fabrication facilities? (Re-entrant flow lines have 
attracted significant interest from the research community in recent years because of 
their direct applicability to semi-conductor fabrication.) This question was motivated 
by a fab visit at AMD and is studied by Bispo & Tayur (1996). Along the way we 
are able to study serial multi-product capacitated systems. The framework is similar 
to that described in § 2 above, but requires a far more detailed analysis. 

(2) Lead time quotation. To compete effectively, suppliers are realizing that three aspects 
of lead times are important to customers - short, accurate and consistent - and that 
this ‘service’ has to be provided at minimum cost. Several improvements along the 
just-in-time (JIT) philosophy (and quality paradigms) have made production processes 
fairly reliable at many supplier plants. While negotiations with customers have reduced 
the variability of order streams, these variabilities continue to remain significant and 
further reductions seem unlikely given that the customers themselves face an intensely 
competitive marketplace with increasingly fickle and demanding (end) customers. 
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We concentrate on (in Kapuscinski & Tayur 1996b) the uncertainty of demand from 
multiple customer classes (\ < i < M) and consider production times to be deterministic. 
It is costlier to quote a longer lead time to a customer from a higher class than from a 
lower one; in particular, the cost is linear with rate m,- for class i customer. We impose 
the constraint that we will ship the order to the customer with a 100% reliability within 
the quoted lead time; we may ship earlier at no penalty, but we gain no benefit from the 
customer for this early shipment. Our original motivation to study this problem came 
from a company that produces ‘rolls’ that are used in steel mills. The goal was to find 
an easily impleinentable lead-time quoting rule, preferably graphical, that the sales force 
could use. 

Briefly, we consider a finite-horizon, discrete-time production-inventory model with a 
single stage, single product that faces a stochastic demand from many customer classes in 
any period. Processing time is detemninistic. In each period, after the demands are realized: 

(a) we quote lead times to these demands, 

(b) we make production decisions for this period, and 

(c) we ship some material (sometimes earlier than its due date). 

Items (2) and (3) above determine the inventory level at the end of the period, on which we 
pay a holding cost of h per unit (per period). Note that quoting a large lead time for a low 
margin customer (and so having a lot of capacity in anticipation of future higher margin 
customers) causes a penalty from this low margin customer, while a short quoted lead time 
would force a higher margin customer amving in the near future to wait a little longer. This 
is the basic trade-off. We find a policy that is simultaneously optimal for 100% reliable 
quotation, production, inventory management and shipping. We show that for some cases 
the policy is easily implementable. 

A simple model relating lead times, inventories and batching is studied by Karmarkar 
(1987); however, no quotation of due dates is considered there. 

(3) Co-operation in supply chains. Industrial supplier-customer relations have undergone 
radical changes in recent years as the philosophy behind managing manufacturing 
systems is influenced by several Japanese manufacturing practices. As more organiza¬ 
tions realize that successful in-house implementation of just-in-time alone will have 
limited effect, they are seeking other members of their supply chain to change their 
operations. This has resulted in a certain level of co-operation, mainly in the areas of 
supply contracts and information sharing, that was lacking before. This is especially 
true when dealing with customized products, and is most commonly seen between 
suppliers and their lai'ger customers. 

We incorporate (in Gaviraeni et al 1996) information flow between a supplier (or pro¬ 
ducer) and a customer in a capacitated setting of a simple supply chain. The customer 
faces i.i.d. end-product demand, and the supplier has a finite capacity in each period. We 
consider three situations: (1) a traditional model where there is no information to the sup¬ 
plier prior to a demand to him except from past data; (2) the supplier has the infoiTnation 
of the {s, S) policy used by the customer as well as the end-product demand distribution; 
and (3) full information about the state of the customer. Each of these leads to different 
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n-stationary demand processes as seen by the supplier. Study of these three models 
ables us to understand the relationships between capacity, inventory and information at 
; supplier level. We show that order up-to policies continue to be optimal for models 
th information flow for the finite horizon, the infinite horizon discounted and the infinite 
rizon average cost cases. We develop a quick recursive solution procedure to compute 
j optimal parameters when capacity is infinite. For the finite capacity case, we develop 
d validate Infinitesimal Perturbation Analysis (IPA), as well as show' how the solution 
■ the uncapacitated system can be easily modified to obtain approximate values. Using 
;se solution procedures we estimate the savings at the supplier due to information flow 
d study when information is most beneficial by varying capacity, holding costs, demand 
;tributions and S — s values. 


Summary 

veral significant advances have occurred in the study of capacitated systems since 1991. 
lis paper informally provides an introduction to topics, and some techniques. The ref- 
jnce list below is not exhaustive at all, but should provide a good starting point. At 
ist two very interesting developments (within a single product setting) are not consid- 
id in this informal review; (1) supply contracts and (2) international supply chains. See 
heller-Wolf & Tayur (1997), for example. 
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On the optimality of exhaustive service policies in multiclass 
queueing systems with modulated arrivals and switchovers 

Y NARAHARl^ and N HEMACHANDRA 

Department of Computer Science and Automation, Indian Institute of Science, 

Bangalore 560012, India 

e-mail: [hari,hemaj @csa.iisc.emet.in 

Abstract. Consider a single-server multiclass queueing system with K classes 
where the individual queues are fed by X-correlated interrupted Poisson 
streams generated in the states of a AT-state stationary modulating Markov 
chain. The service times for all the classes are drawn independently from 
the same distribution. There is a setup time (and/or a setup cost) incurred 
whenever the server switches from one queue to another. It is required to min¬ 
imize the sum of discounted inventory and setup costs over an infinite horizon. 
We provide sufficient conditions under which exhaustive service policies are 
optimal. We then present some simulation results for a two-class queueing 
system to show that exhaustive, threshold policies outperform non-exhaustive 
policies. 

Keywords. Multiclass-queue; MMPP arrivals; setup times; exhaustive 
policies. 


1. Introduction 

Hofri & Ross (1987) investigated the following scheduling problem in multiclass queues 
with setup times (see figure 1); There is a single server attending to two classes of customers 
from two queues fed by independent homogeneous Poisson processes. The service times for 
the two classes are drawn independently from the same (general) distribution. A switchover 
time with or without a constant monetary cost per switch made is involved whenever the 
server switches from one queue to the other. An inventory holding cost, linear in queue 
length and having the same rate at the two queues is also included. Two separate cost 
structures are considered; 

(1) Sum of discounted switchover cost and inventory holding cost over an infinite horizon 
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Buffer 1 



Buffer 2 

Figure 1. A two-class queue with independent Poisson arrivals. 


In both cases above, Hofri & Ross (1987) showed that the policies to minimize the cost: 

(1) are necessarily exhaustive, i.e., server may switch to the other queue only when the 
current one is empty, and 

(2) are likely to be threshold policies, i.e., the server switches (from an empty queue) only 
when the other reaches a critical size. 

The first result, in fact, holds for three or more queues. Also, for the two-queue case, a 
detailed queueing-based procedure was given for computing the optimal thresholds. 

In this paper, we look at the following variant of the problem: The input streams to the 
two queues are no longer independent but are correlated in a Markov-modulated Poisson 
process (Fischer & Meier-Hellstem 1993) sense. (A Markov-modulated Poisson process 
can be informally described as an arrival process in which the Poisson arrivals have their 
rate modulated by a finite state, irreducible continuous-time Markov chain. In a Jt-state 
MMPP, the anivals are Poisson at a certain rate (say. A./) so long as the CTMC is in 
state i, VI e {I,..., K}. This is a popular model for nonrenewal input to queues.) More 
specifically, we look into the system (depicted in figure 2) where the input streams are 
the two interrupted Poisson processes generated in the states of a two-state homogeneous, 
stationary modulating Markov chain. Consequently, the inputs to the queues are correlated. 
The motivation for considering input processes of this type comes from several situations 
in manufacturing and communication networks (Fischer & Meier-Hellstem 1993; Frost & 
Melamed 1994). For example, under Markovian switching, the output from a multiclass 
flexible machine constitutes an MMPP (Hemachandra & Narahari 1995). Similarly, the 
output of a failure-prone Markovian queue has the so-called on-off arrival feature - a 
Poisson output for an exponential amount of time when the machine is working and zero 
output for another exponential amount of time when it is not-working. We remark that the 
input to each class is an intermpted Poisson process which is stochastically equivalent to 
a hyperexponential renewal process (Fischer & Meier-Hellstem 1993). 

The problem and the investigations in this work are directly inspired by the work of Hofri 
& Ross (1987). Indeed, we follow the same line of argument and technical conditions 
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Buffer 1 



Buffer 2 

Figure 2. A two-class queue with modulated arrivals. 


to corroborate our results. In this paper, we focus on the discounted cost criterion and 
investigate whether the optimal scheduling policies have to be necessarily exhaustive, like 
in the case of the Hofri-Ross formulation. Investigation of the correctness of this conjecture 
leads us to a set of sufficient conditions under which the conjecture is indeed true. 

Apart from the seminal work of Hofri & Ross (1987), the literature that is relevant 
to this work is concerned with optimal scheduling in the presence of setups. The classic 
economic lot scheduling problem (Elmaghraby 1978) is the deterministic version of the 
multiclass scheduling problem with setups and has very elegant closed-form solutions. In 
the stochastic version which is more relevant here, two separate formulations exist. In the 
setup cost problem, a setup cost is incurred on each setup and in the setup time problem, a 
random setup time is incurred when the server switches class. The setup time problem is 
more realistic than the setup cost problem (Reiman & Wein 1994) but is also more difficult 
to solve. 

Several researchers have studied the setup problem with more than two classes of 
customers, using a variety of techniques such as dynamic programming and heavy traf¬ 
fic queueing theory. A comprehensive survey appears in Reiman & Wein (1994) and 
Ravikumar (1996). All the authors however assume independent arrival processes into 
the individual queues. In § 2, we present the notation for the multiclass queueing model 
considered in this paper and describe the objective function to be optimized. In § 3, we 
investigate the conjecture that exhaustive policies are optimal and this leads to a set 
of sufficient conditions under which this conjecture is indeed tme. In order to explore 
the validity of the conjecture, we carry out simulations on a two-class queueing sys¬ 
tem in § 4, which show that exhaustive, threshold policies outperform non-exhaustive 
policies. 

2. The model 

We consider a 2-class queue, representative of multiclass queues as a whole. Let the rates 
of the interrupted arrival processes to the two queues be Xi and A .2 respectively, so that 
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the overall arrival process is a two-state MMPP. Let a\ {a 2 ) be the rate trom State I tc 
State 2 (from State 2 to State 1) of the two-state Markov chain modulating the arrivals. 
Let n\ and ni be the stationary probabilities of the modulating Markov chain so that 
TT] = OLiJiot] +q; 2 ) and 7t2 = +ot 2 )- Service time distribution for a customer 

from either of the queues is tire same and let 5i, ^ 2 , 53, • • • be the sequence of service 
times (i.Ld.) for customers of Queue 1 and T\. T 2 , Ts, • • • the sequence of service times 
(i.i.d.) for customers of Queue 2. Note that {5 h} and {Tn} are from the same service time 
distribution, with finite mean. Let and be two (possibly different) sequences 
of i.i.d. positive random variables with some general distributions, corresponding to the 
sequence of setup times at Queue 1 and Queue 2 respectively. The state of the system is 
then a four-tuple (m, xi, X 2 , u), where, 

• u gives the status of the server taking values in the set {I\, I 2 , 1,2, 12, 21} where /; 
means server is idle but setup for class i(i — 1,2); 1 denotes that tlie server is currently 
serving Class 1 customers; 12 means that the server is switching from Class 1 to 2, and 
the other symbols have similar meaning. 

• Xi is the number in the queue (including the one in service, if any) i(i = 1,2), taking 
integer values. 

• V takes values in the set (1,2} giving the state of the modulating Markov chain. 


Let S be this state space. As in Hofri & Ross (1987), the decision epochs will be at service 
completions, switch completions and arrival instants. The actions at each such decision 
epoch are from the set A with A= {5, C, 7} where these symbols successively mean tliat 
the server chooses to serve, change and idle. So the problem is a semi-Markov decision 
problem with state space S and action space A given above. 

Let be the number of customers in queue i (i — 2) at time t, given that the 

initial state of the system is z, and a policy tt is used. Let 

X^'"At) = xYAt) + xYAt) 


be the number in the system at time t. Let be the number of switches made by 

the server up to time t under policy 71. Then, the expected discounted cost, with discount 
factor yS > 0, V7r (z), incurred when a policy tt is followed for a system starting in state z, 
is given by 


■ ^00 

V^(^) = E / e-^^X^'\t)d.t 

Jo 



( 1 ) 


assuming that the holding costs per customer per unit time in both the queues are the same 
and are normalized to 1 and that a is the one-time switch charge levied at the start of each 
switch. A policy is said to be optimal if it attains 


y (z) := y^/(z) = inf (z) Vz € 5. 

71 

To suit the analysis ahead, we cast the objective function as in Harrison (1975): Let 
be the number of arrivals up to time t and the number of departures up to 

time t for the queue i (i = 1, 2) when a policy tt is followed in a system starting at state 
z. Let X® and x^ be the number in Queue 1 and Queue 2 respectively at time t = 0. Then, 
we have. 
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= x'; + - Df’M/) / = i,2, 


JT,; 


( 2 ) 


so that from (1) and (2) vve have. 


. .if + a:? 7t\X\TCoX-) 

V"7r(z) = -~-^ + 






■ /-OO , 
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(3) 


where is the Laplace-Stieltjes transform (LST) of the distribution R^’^{t). Since, 


\f 

Lao 


+ D?’’(0)d/ - aR^'H^) 


= -E 




where, is the kth departure epoch from the system under policy jt, we have, 


x^ + x^ ^ Tr\X] + jTjXi 1 


V^U) = 


-h 




-jJAz), 


with Jjr (z) defined as. 


Jtt (z) — E 





The optimizing problem is now equivalent to maximizing Jniz). If J iz) is the supremum 
of Jk (z) over all policies, then by Lippman (1973), there is an optimal pure policy which 
achieves /(z). 


3. Optimality of exhaustive policies 

Let g be any non-exhaustive policy; we shall argue below that the performance of any such 
policy could be improved upon. This means that non-exhaustive policies are suboptimal. 
Thus, optimal policies have to be necessarily exhaustive. 

First, we synthesize from the existing arrival process, a new arrival process that depends 
on the scheduling policy followed. Let / be any such policy. 

_Let Ni(t) and N 2 (t) be the arrival processes to Queues 1 and 2. Assume that N 

A 2 (Q) is stochastically equivalent but independent of A(r) := (Ai(0, A/ 2 ( 0 ).Let 
(lf2, T, P) support the independent processes A(r), N{t) and the sequences {5^}, {T^}, 
(C'l.nj, {E 2 ,n}- As in Hofri & Ross (1987), form a new arrival process from these N{t) 
and Nit) by collecting customers from Nit) or Nit) as described below. Let r-f be the 
first instant that the server begins to serve a customer from Queue 1 under policy /. Let 
be the service time of this customer from Queue 1. Define a new arrival process A-f (?) 
as: 

Afit) = Nit), 0<t <Tf, 

= Nir^) -)- Nit — r-f), r-f < t < z-^ + 5i, 

= NiSi) + Nit -Si), t >Tf + Si. 


( 4 ) 
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That is, collect customers from Nit) until then on from N{t) until + 5'i; and 
revert back to indefinitely after -{-Si. Note that is defined pathwise and its 
construction depends on policy /. 

Since g is non-exhaustive, there exists some state, z, such that either g causes the server 
to idle at a nonempty queue or abandons a nonempty queue. Without loss of generality, 
assume that this occurs at Queue 1. Then, if g(u,x\,X2,v) denotes the scheduling decision 
in the state (u, xi, X 2 , u), 

3x1 >0 9 g(l,xi,X 2 , u) ^ 5, 

i.e., 

g(l,xi,X2, u) G {C, /} with XI > 0. 

Let this zo •= (1, xi, X 2 , u) be the initial state in the sequel. Let (f) denote the state 
of the system at time t when a policy / is followed by the system starting in a state zo- 
Write for 

We now exhibit a policy tc that outperforms the policy g in minimizing the total cost. 
Policy n says that the system beginning in state zo should first immediately serve a job 
from Queue 1 (let 5i be the service time of this customer). Then, n follows policy g for 
a random amount of time from 5] to + 5i with a lag of S] . From Zg + 5i onwards 
it simply follows policy g with no delay. So, = 0. Writing out A^(t) and A^(t), the 
arrival processes corresponding to policies n and g using (4), we have that they are related 
as; 

A^(t) = +1^) — 0<t<Si, 

= AHt - Si) + A^(Si), Si < t < + Si, 

= A^(0, r>T^ + Si. 

Note that the departure processes, D^{t) and are related as 

D^{t)=f), 0<r<Si, 

= 1 + Z)^(t-Si), Si<t<T^ + Si, (5) 

= t>r^ + Si. 

Essentially, we have states coupled (stochastically) by the policies g and n in such a 
way that 

Z^{t) = Z^{t) fort>r^ + Si. . , 

Here is the crucial assumption. 

Assumption A. Let the arrival processes A^ (t) and A^ (t) formed above be stochastically 
equivalent, so that the systems when driven by any of these processes have the same 
performance measures when the same policy is used for both the systems. 


Remark. This assumption is satisfied by independent Poisson processes as noted by Hofri 



L,ei (ij at least one oj me values a, it [Cij, it tC 2 j t?e strictly greater man zero ana {it} 
the processes A^(t) and A^{t) he equal in law. Then, any non-exhaustive policy g is 
non-optimal. 

Proof. Using (5) above, we show that the difference, 

Jk(zo) - Jgizo) 


y3E 

f e-^‘[D^{t)-DHt)]dt 

- afE 

f e~^^[dR^(t) ~dRHt)] 


jt>0 


Jt>0 


is positive. This difference has two terms on the RHS, the first one capturing the holding 
cost and the other the switching cost. 

Let <T^ be the first time that g switches from Queue 1 when the initial state is zq. On 
each of the sets 

Ai := {r^ < oT'^}, A 2 := = 00 }, A 3 := {r^ > cr^ < co}, 

we compute this difference assuming that E [C 2 ] > 0 or a > 0. 


Case I (t^ < a^). Here we have R^(t) = P^(t), t > 0 and hence the difference in 
switching costs as captured by the second term above is zero. Since 

D^(t) = DH0 + l, tG[Si,r^ + Si], 

= DHt), t ^ [Si, T^ + Sil 


we have, 



e-^'[D^(t)~D^(t)]dtlAi 


= E[E[^-^‘^>(l-^"^"')iAi,r^]|Ai] 
= E[(1 -e"^'^")E[e“^^MAi,T^]|Ai] 
= E[e-^^^]E[(\ - e-^^')\Al] 


Now, E[e ^'^ 1 ] =: S{f) and since is strictly positive a.s. we have this term greater than 
zero and thus ( 6 ) is positive in Case 1. 


Case 2 ~ 00 ). Policy g simply idles forever and hence the difference in 

switching costs is zero and the difference in holding costs is given by 



e-l>‘[D’’(t) - DHt)]dt\A2 


= E[e-^^MA 2 ) = 5(« 


SO that, here also (6) is positive. 
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Case 3 > ct^ < oo). Many scenarios are possible here with policy g deciding to opt 

for a switching before taking up a job from Queue 1 (in finite time a.s.). In all of them, 
however, R^it) lags behind (t) by S\ till + Si and from then on they are the same. 
Here, we show that the difference (6) is strictly positive by showing that the first term 
(holding cost difference) is strictly positive if E [C 2 I > 0, and the negative of the second 
term (switching cost ditference) is also strictly positive, if a > 0. 

First, it follows as in Hofri &. Ross (1987) again, that the negative of the switching cost 
difference in (6) above can be lower bounded, 


-a^E 



- 

e~^^[dR^(t) -dRHMAi 






and hence, is strictly positive if a > 0. 

If E[C 2 ] > 0, it follows that the holding cost difference in (6) is strictly positive as 
shown below. Write 


■ /“OO 

DHt)]dt\A^ 

L7o 

= |^^[P(D''(0 > k,^\A^) - PiD^t) > k, 4>|A3)]j dr 


where 'F := (r < 5] + r^'}. Note that departure processes are stochastically equal for 
r > S\ + With Tf denoting the ith departure epoch from the system under policy 
g, (Tq := 0), set Uj := Tf - Tf__y Then, with Uk >st Si , we have. 


PiUi + U 2 ~\ — • + Ujc-\ 4- -Sj < r) > P(U\ 4- 1/2 H-b 5 0 

Vr > 0, it > 0 

P(D^(t) > ^\A 3 )>P{DHt) > k, vklAs) Vr > 0, k>0 


so that, the above is indeed positive. If, E [C 2 ] > 0, then from the fact that Ui includes the 
service time of a Job as well as switchtime or idling time, we have, 

< r, ^|A 3 ) > P(f/i < f, 'FiA 3 ) 4- €, 

for some e > 0 and t in an open interval. This means that 

PiD^it) > 1,^1A3) > P(DHt) > 1,v1/1A3) + 6, 

for some e > 0 and t in an open interval. So, finally, we have (6) strictly positive under 
the hypothesis. 


Remark 1. It is easy to see that the above arguments go through for queues with more than 
two classes; we require that at least one of a, {E[Qj]} be strictly positive. 

Remark 2. The proof above uses the stochastic coupling idea and the requirement on the 
part of the arrival processes, as set out in assumption A, facilitates coupling of the evolution 
of the systems following the policies pathwise, after the coupling epoch, 4- 5i. Since 
the technical nature of MMPP is not used in the above proof, we conjecture that the above 
result is valid for a much more general arrival process that satisfies assumption A. 
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Remark 3. Policy n is more exhaustive than policy g in the sense that it serves one job 
from the same queue in the very beginning. As the above argument is sample pathwise, 
by induction, this means that exhaustive policies are better than non-exhaustive policies. 

Remark 4. We left open here the decisions the server has to take when the queue to which 
it is attached becomes empty. One class of policies to use in this context is that of the 
threshold policies in which Hofri & Ross {1987) show that such policies are optimal when 
arrivals are Poisson. 

Remark 5. Verifying the sufficiency condition involving the arrival process (Assumption 
A) is difficult; indeed it may be necessary to pursue another track of investigation to get 
around this problem (M Hofri 1996, personal communication). 


4. Simulation results 

Since it is difficult to verify the validity of the sufficient conditions concerning the equiva¬ 
lence of the aiTival processes (assumption A), we simulated quite a few systems operating 
under different policies and looked at the cost incuiTed by them. This way, we investigated 


Table 1. 

Setup costs and inventory holding costs for a bursty scenario. 


Hi n2 

A^i N2 


SW2 

Switching cost 

Holding cost 

Total cost 

1 1 

23 

1337.6 

1252.4 

2590.0 

798.0 

3388.0 

1 2 

23 

1342.5 

1267.4 

2609.9 

1526.3 

4136.2 

1 3 

23 

1456.1 

1404.4 

2860.5 

2228.2 

5088.7 

2 1 

23 

1375.8 

1506.3 

2882.1 

1656.1 

4538.2 

3 1 

2 3 

1817.8 

1909.5 

3727.3 

2142.1 

5869.4 

2 2 

23 

1387.9 

1.532.7 

2920.6 

2329.9 

5250.5 

23 

23 

1504.2 

1675.4 

3179.6 

3008.8 

6188.2 

1 1 

22 

1342.5 

1275.4 

2617.9 

591.1 

3209.0 

1 2 

22 

1456.1 

1412.3 

2868.4 

1325.2 

4193.6 

1 3 

22 

1785.4 

1755.5 

3540.9 

2190.7 

5731.7 

2 1 

22 

1387.4 

1540.7 

2928.6 

1394.6 

4323.2 

3 1 

22 

2001.6 

2132.1 

4133.7 

1780.2 

5913.9 

11 

3 2 

1160.9 

1101.5 

2262.4 

732.5 

2994.9 

1 2 

32 

1259.0 

1224.1 

2483.1 

1507.8 

3991.0 

1 3 

32 

1401.4 

1377.6 

2779.0 

2441.1 

5220.1 

1 5 

3 2 

1401.9 

1379.2 

2781.1 

4154.4 

6935.5 

2 1 

3 2 

1112.3 

1275.4 

2387.7 

1511.3 

3899.0 

22 

32 

1225.9 

1412.3 

2638.3 

2245.4 

4883.7 

2 3 

32 

1555.2 

1755.5 

3310.7 

3111.0 

6421.7 

24 

32 

1633.7 

1835.9 

3469.6 

3972.3 

7441.9 

3 1 

3 2 

1202.3 

1357.6 

2559.9 

2116.1 

4676.0 

3 2 

32 

1318.6 

1499.7 

2818.3 

2831.4 

5649.8 

3 3 

32 

1869.5 

2065.4 

3934.9 

3628.2 

7563.1 

4 1 

32 

1816.0 

1949.0 

3765.0 

2501.7 

6266.7 

42 

32 

2187.1 

2348.6 

4535.7 

3073.3 

7608.9 

5 1 

32 

2237.9 

2343.8 

4580.8 

2907.0 

7487.8 

52 

32 

3018.9 

3158.6 

6177.6 

3428.1 

9605.6 
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Table 2. Setup and inventoiy holding costs for exhaustive policies. 


n\ nj 

A^i N 2 

SWi 

SW 2 

Switching cost 

Holding cost 

Total cost 

1 1 

1 1 

1734.3 

1683.7 

3418.0 

294.1 

3712.1 

1 1 

1 2 

1618.1 

1540.7 

3158.8 

474.4 

3633.2 

1 1 

1 3 

1606.0 

1506.3 

3112.3 

735.9 

3848.2 

] 1 

1 4 

1257.2 

1149.2 

2406.4 

964.8 

3371.1 

1 1 

1 5 

1217.1 

1096.6 

2313.6 

1256.0 

3569.6 

1 1 

2 1 

1456.1 

1412.7 

2868.8 

433.9 

3302.7 

1 1 

22 

1342.5 

1275.4 

2617.8 

591.1 

3209.0 

1 1 

23 

1337.6 

1252.4 

2589.9 

797.4 

3387.3 

1 1 

24 

1232.9 

1137.4 

2370.4 

936.8 

3307.1 

1 1 

25 

1184.9 

1075.7 

2260.6 

1233.0 

3493.7 

1 1 

3 1 

1259.0 

1224.5 

2483.5 

616.5 

3100.0 

11 

32 

1160.9 

1101.5 

2262.4 

732.5 

2994.9 

1 1 

33 

1160.1 

1082.4 

2242.5 

930.6 

3173.1 

1 1 

34 

1056.1 

970.1 

2026.2 

1008.6 

3034.8 

1 1 

35 

1008.5 

910.2 

1918.7 

1268.7 

3187.4 

1 1 

4 1 

1122.2 

1091.3 

2213.4 

940.6 

3154.1 

1 1 

42 

1024.7 

971.6 

1996.3 

1027.7 

3024.0 

1 1 

43 

1019.7 

950.9 

1970.5 

1209.7 

3180.1 

1 1 

44 

1019.2 

939.8 

1959.0 

1150.3 

3109.3 

1 1 

45 

1001.0 

905.5 

1906.5 

1324.0 

3230.5 

1 1 

5 1 

1096.3 

1067.1 

2163.8 

1035.9 

3199.3 

1 1 

52 

1016.4 

965.3 

1981.7 

1094.2 

3075.9 

1 1 

53 

1015.8 

949.0 

1964.7 

1263.5 

3228.2 

1 1 

54 

1005.1 

927.3 

1932.4 

1201.0 

3133.4 

1 1 

55 

998.8 

905.2 

1904.0 

1371.3 

3275.3 


the performance of the exhaustive policies. Also, we confine our experimentation to the 
class of exhaustive, threshold policies motivated by the results of Hofri & Ross (1987) 
concerning the optimality of threshold policies in the case of independent Poisson arrivals. 
We remark here that exhaustive, threshold policies may not be the optimal policies. 

A typical policy we considered is captured by the four tuple {n\,n 2 ,N\, N 2 ). Here, 
n\ and n 2 are idle thresholds while N\ and N 2 are switching thresholds. n\ (^ 2 ) is the 
minimum number of jobs required in Queue 1 (Queue 2) for the server to process a job 
from that queue when attached to the queue. In other words, even though the server is 
attached to Queue 1 (Queue 2) and the number of customers in that queue is less than ni 
(«2)> it will not process any job from that queue. Note that a service policy is exhaustive 
if ni = n 2 = 1. N\ {N 2 ) is the minimum number of jobs required in Queue 1 (Queue 2) 
for the server, currently attached to Queue 2 (Queue 1), to switch to Queue 1 (Queue 2). 
Ni and N 2 are called the switching thresholds. 

Note that, in view of the exhaustive nature of the policies, n takes precedence over N. 
So, in an empty system, the server when attached to a queue, say Queue 1, potentially idles 
there till the number in this queue is greater than or equal to n 1 . If, during this idling period, 
the number in Queue 2 becomes at least N 2 , it switches. Similarly, the server continues 
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le 3- Setup and inventory holding costs for a non-bursty scenario. 


Ni N 2 

SWi 

SW 2 

Switching cost 

Holding cost 

Total cost 

45 

692.3 

550.8 

1243.1 

1189.5 

2432.6 

45 

694.2 

556.6 

1250.9 

1766.7 

3017.6 

45 

735.0 

599.6 

1334.5 

2329.9 

3664.4 

45 

863.1 

732.0 

1595.2 

2814.6 

4409.4 

45 

764.0 

618.6 

1382.6 

2098.3 

3480.9 

45 

775.9 

634.8 

1410.6 

2653.2 

4063.8 

45 

800.2 

663.4 

1463.7 

3238.8 

4702.4 

45 

929.0 

796.1 

1725.1 

3702.2 

5427.4 

45 

886.5 

737.9 

1624.4 

3088.9 

4713.3 

45 

898.4 

754.8 

1653.2 

3624.8 

5278.1 

45 

999.8 

857.6 

1857.5 

4175.7 

6033.1 

45 

1129.3 

992.0 

2121.3 

4606.1 

6727.4 

45 

888.3 

737.9 

1626.2 

4027.6 

5653.8 

45 

900.2 

754.9 

1655.1 

4563.0 

6218.1 

45 

1001.7 

857.7 

1859.4 

5113.2 

6972.5 

45 

1131.7 

992.0 

2123.7 

5537.7 

7661.3 

45 

1222.0 

1049.4 

2271.4 

5140.6 

7412.0 

45 

1336.9 

1167.2 

2504.1 

5705.8 

8209.9 

45 

1501.0 

1342.8 

2843.8 

6118.0 

8961.8 

45 

1323.9 

1146.8 

2470.7 

6124.7 

8595.4 

45 

1442.9 

1270.0 

2712.9 

6666.4 

9379.3 

45 

1711.6 

1542.7 

3254.3 

6968.2 

10222.6 

44 

835.9 

699.1 

1535.1 

949.2 

2484.3 

44 

876.7 

742.2 

1618.9 

1559.7 

3178.5 

44 

1004.8 

875.6 

1880.4 

2074.3 

3954.7 

44 

1012.0 

890.8 

1902.9 

2746.0 

4648.9 

44 

917.7 

777.3 

1695.0 

1834.3 

3529.3 

54 

831.2 

698.8 

1530.0 

1062.2 

2592.2 

54 

871.5 

741.8 

1613.4 

1685.7 

3299.1 

54 

999.2 

875.3 

1874.5 

2217.1 

4091.6 

54 

1006.3 

890.5 

1896.8 

2890.0 

4786.8 

54 

835.0 

699.2 

1534.1 

1936.9 

3470.9 

55 

688.8 

550.4 

1239.2 

1268.1 

2507.3 

55 

690.7 

556.3 

1245.0 

1847.2 

3094.2 

55 

731.0 

599.2 

1330.2 

2423.3 

3753.5 

55 

691.3 

550.8 

1242.1 

2177.2 

3419.4 


Examples 

re, we look at the performance costs incurred by two systems as they follow policies 
ich are more and more exhaustive, i.e., with ni, n 2 or both, becoming less. For the 
Items we considered, the simulations show that the exhaustive policies incur the least 
;t. 


ample 1. (Bursty arrivals). Recall that a central motivation for the use of MMPP as an 
ival process is to capture bursty arrivals to a queue. We wish to look at such a scenario 
w in the context of service and scheduling policies. Let cii,oi 2 ,'k\, and X 2 have the usual 
nificance. The numerical data are: a\ = 1/10, 0:2 = 1/60, = 0.2, A .2 = 0.04. Let 

! setup time be deterministic at a value 10 for both the queues and the mean service time 
terrainistic at 0.50. The discount factor = 0.001, while the setup cost a = 250. 
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Table 4. Setup and inventory holding costs for exhaustive policies. 


m 112 

N\ N 2 

SWi 

SW 2 

Switching cost 

Holding cost 

Total cost 

1 ] 

1 1 

1486.9 

1360.7 

2847.5 

183.0 

3030.5 

1 1 

1 2 

1426.8 

1286.4 

2713.2 

334.7 

3047.5 

1 1 

1 3 

1149.1 

1001.1 

2150.9 

644.3 

2794.5 

1 I 

1 4 

1047.6 

897.4 

1945.0 

782.3 

2727.3 

1 1 

1 5 

892.4 

737.9 

1630.4 

1085.1 

2715.5 

1 1 

1 6 

813.7 

653.6 

1467.3 

1346.2 

2813.5 

11 

2 1 

1482.0 

1360.5 

2842.5 

247.0 

3089.5 

1 1 

2 2 

1422.2 

1286.3 

2708.6 

395.4 

3103.9 

1 1 

23 

1145.3 

1001.1 

2146.4 

694.0 

2840.4 

1 1 

24 

1043.9 

897.3 

1941.2 

831.3 

2772.6 

1 1 

25 

890.2 

737.9 

1628.0 

1114.6 

2742.7 

1 1 

26 

811.9 

653.5 

1465.4 

1367.2 

2832.7 

1 1 

3 1 

1236.1 

1119.6 

2355.7 

394.3 

2750.0 

1 1 

32 

1218.9 

1090.5 

2309.4 

483.1 

2792.5 

1 1 

3 3 

943.0 

806.8 

1749.8 

744.0 

2493.9 

1 1 

34 

918.6 

777.3 

1695.9 

846.6 

2542.5 

1 1 

3 5 

765.0 

618.6 

1383.6 

1110.7 

2494.3 

1 1 

3 6 

681.7 

533.0 

1214.8 

1338.9 

2553.6 

] 1 

37 

644.5 

494.2 

1138.7 

1478.8 

2617.6 

11 

4 1 

1013.5 

900.2 

1913.7 

543.8 

2457.5 

1 1 

42 

1006.3 

882.2 

1888.5 

6II.0 

2499.5 

] 1 

4 3 

876.7 

743.0 

1619.7 

824.1 

2443.8 

l 1 

44 

835.9 

699.1 

1535.1 

949.1 

2484.2 

11 

45 

692.3 

550.8 

1243.2 

1189.6 

2432.8 

1 1 

46 

615.7 

469.7 

1085.5 

1392.7 

2478.2 

1 1 

47 

579.1 

430.9 

1010.0 

1522.9 

2532.9 

i 1 

48 

576.2 

424.6 

1000.7 

1646.8 

2647.6 

1 1 

49 

499.4 

348.2 

.847.6 

1968.9 

2816.6 

1 1 

5 1 

1007.5 

899.8 

1907.3 

698.9 

2606.1 

1 ] 

52 

1000.4 

881.8 

1882.2 

764.7 

2646.9 

1 ] 

5 3 

871.5 

742.6 

1614.1 

950.0 

2564.1 

11 

54 

831.21 

698.8 

1530.0 

1062.1 

2592.1 

1 1 

5 5 

688.76 

550,4 

1239.2 

1268.0 

2507.2 

1 1 

5 6 

612.5 

469.7 

1082.1 

1473.2 

2555.4 

] 1 

6 1 

986.0 

881.7 

1867.7 

902.5 

2770.2 

1 1 

62 

979.0 

865.0 

1844.0 

957.3 

2801.3 

1 1 

63 

850.3 

726.3 

1576.5 

1131.5 

2708.0 

1 1 

64 

827.3 

698.0 

1525.4 

1150.9 

2676.3 

1 1 

65 

685.31 

550.0 

1235.3 

1352.4 

2587.8 


This system was simulated over a sufficiently long simulation run length of 10,000 
time units and the total cost (as given by (1)) was computed for different sets of values of 
n\, no, N], N 2 . It was found that the run length of 10,000 was an adequate indicator of the 
infinite horizon discounted cost since the accumulated cost became virtually saturated for 
this run length. The simulation clock was made to start ticking only after ensuring that the 
modulating Markov chain was well into the steady state. 

Table 1 shows for different values of /ij, 112 , Ni, N 2 the values of the accumulated 
discounted switching costs from Queue 1 and Queue 2, respectively, as SWi, SW 2 , the 
total switching cost (5Wi 4- SW 2 ), the accumulated (discounted) inventory holding cost, 
and the overall discounted cost. Observe that three sets of values have been explored for 
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and namely, N\ =2, N 2 — 3; N\ =2, A^2 = 2; =3, A^2 = 2. From table 1 

see that in all these three sets, exhaustive policies as given by = n 2 = 1 have the 
;t performance. 

tot, having seen that exhaustive policies perform better, we investigate the effective- 
is of threshold switching by looking at the cost incurred for different values of thresholds 
and N 2 (see table 2). For this system, the threshold N\ — 3, N 2 — 2 offers the least 
;rall discounted cost among threshold policies considered. 

ample 2. (Non-bursty arrivals). This scenario comes up when the streams of jobs have 
nparable intensities with a correlation as captured by the 2-state Markov chain. We 
fQ here, a\ = 012 — 1/60, X\ = 0.20, X 2 = 0.10. The service is deterministic at value 
0. The setup cost is 200 while the setup times are deterministic at values 5 and 5. The 
count factor ^ = 0.001, The notation in tables 3 and 4 is as in the previous example. 
)le 3 demonstrates that exhaustive policies are better when thresholds are taken as (4,5); 
4); (5,4); and (5,5). Then, fixing the policy as exhaustive, i.e., with n\ = n 2 = 1, we 
rch for the thresholds which offer the least overall discounted cost; Table 4 shows that 
thresholds Ni ~ 4, N 2 = 5 offer the best performance among the policies considered. 

Conclusions 

[■ a single-server multiclass queue fed by the streams of an MMPP, with non-preemptive 
itchovers involving a switchover cost and/or a switchover time, we argued that ex- 
istive policies are optimal as long as the arrival process satisfies a technical condition 
;arding its law and the service times of the various classes are the same. The verification 
this technical requirement on the part of the arrival process appears to be difficult. Hofri 
Ross (1987) showed that, when the queue is fed by independent Poisson streams, these 
licies are optimal; here, this technical condition is true. To test our claim, we carried out 
ite an extensive simulation of two examples in the case of a two-class queue. Exhaus- 
e policies were found to outperform non-exhaustive policies in all cases. Further, these 
lulations showed that threshold policies are the best performing among all exhaustive, 
eshold policies. Apart from having a direct argument about the optimality of exhaustive 
licies, the question of effectiveness of the threshold policies needs to be looked into in 
ure. 


is research was supported in part by the Office of Naval Research and the Department of 
ience and Technology grant N00014-93-1017. We would also like to acknowledge the 
xllent facilities at the Intelligent Systems Laboratory, Department of Computer Science 
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Abstract. The well-known priority dispatching rule MOD (Modified Opera¬ 
tional Due Date) in job shop scheduling considers job urgency through ODD 
(Operational Due Date) and also incorporates SPT(Shortest Processing Time)- 
effect in prioritising operationally late jobs; leading to robust behaviour in Mean 
Tardiness (MT) with respect to tightness/looseness of due dates. In the present 
paper, we study an extension of the MOD rule using job-waiting-time based 
discrimination among operationally late jobs to protect long jobs from exces¬ 
sive delays by incorporating an ‘acceleration property’ into the scheduling rule. 
Fonnally, we employ a weighted-SPT dispatching priority index of the form: 
(Processing time)/(Waiting time)“ for operationally late jobs, while the priority 
index is ODD for operationally non-late jobs; and the latter class of jobs has a 
lower priority than the former class. 

In the context of Assembly Job Shop scheduling, some existing literature in¬ 
cludes considerable focus around the concept of ‘Staging Delay’, i.e., waiting 
of components or sub-assemblies for their counterparts for assembly. Some ex¬ 
isting approaches attempt dynamic anticipation of staging delay problems and 
re-piioritisation of operations along converging branches. In the present paper, 
ratlier than depending on such a centralised and largely backward scheduling 
approach, we consider a partially decentralised approach, endowing jobs with 
a priority index yielding an ‘acceleration property’ based on a ‘look-back’ in 
terms of waiting time, rather than ‘look-ahead’. For the particular case, in our 
proposed mle, when a is set at zero and when all jobs at a machine are opera¬ 
tionally late, our rule agrees with MOD as both exhibit the SPT effect. 

In simulation tests of our priority scheme for assembly job shops, in com¬ 
parison with leading heuristics in literature, we found our mle to be particu- 


larly effective in: (1) minimising conditional mean tardiness, (2) minimising 
99 -percentile-point of the tardiness distribution, through proper choice of a. We 
also exploit an interesting duality between the scheduling and queueing control 
versions of the problem. Based on this, some exact and heuristic analysis is 
given to guide the choice of a, which is also supported by numerical evidence. 

Keywords. Scheduling; assembly job shops; queueing; waiting times; dy¬ 
namic priorities. 


1. Introduction 

In job shop scheduling, meeting due dates is a crucial objective. Due date related priority 
dispatching mles essentially are based on a primarily backward scheduling approach. This 
can, however, lead to some uneven loading of machines over time. This cannot be avoided 
even if due dates are internally set because in the setting of a dynamic job shop (that is to be 
governed by dispatching priority mles), no sequential due date setting procedure for jobs 
(in the order of their arrival) can anticipate all possible conflicts among jobs for resources 
over time, even in a non-probabilistic situation. 

Uneven loading of machines over time can be associated with uneven rates of job flows 
between machines and increased waiting times for jobs, due to greater variability in the 
arrival process for jobs coming to a machine. The well-known Modified Operation Due 
Date (MOD) mle proposed and studied by Baker (1984) judiciously incorporates the SPT- 
effect when there exist operationally late jobs at a machine. This leads towards locally 
maximising job flow rates past the machine, and such mobility can partially help to lessen 
the high ups and downs in machine loading over time in a purely backward scheduling 
approach based on ODD alone. Thus, although SPT is a local dispatching mle (associated 
with minimising mean flow time past a machine), the SPT-effect as built into the MOD 
mle indeed turns out to give low Mean Tardiness (MT) in job shop scheduling as such. 
The robustness of MOD (in respect of varying tightness of due dates) relates to the fact 
that ODD relates to minimising maximal operational lateness, while for operationally late 
jobs, minimising their mean flow time correlates with minimising their mean tardiness. 
Baker (1984) had also anticipated consideration of MOD mle in assembly job shops. 

In connection with our observations on the impact of SPT-effect as above in a job 
shop, it may also be stated that a study by Russell & Taylor (1985) has suggested that 
the performance of a pure SPT mle improves as the product stmcture (in assembly job 
shop) grows taller (where in fact there would exist a greater need for job mobility in the 
sense described above). Concern over the uneven machine loading due to purely back¬ 
ward scheduling strategies has also been expressed in the work of Wein & Chevalier 
(1992). 

On the whole, the incorporation of the SPT effect as in the MOD mle due to Baker 
(1984) leads to beneficial effects in job shop scheduling with due dates, and this can be 
interpreted or motivated in different ways as indicated above. 

In assembly job shops (as opposed to pure or component job shops) there is the 
phenomenon of staging delay (Adam et al 1987), i.e., waiting of a component for the 


Assembly 
Opentilon, A1 



Staging delay for A1 is: 
Difference between completion 
times of components Cl and C2. 


Staging delay for assembly 'A* 
with 'n* components with 
completion times C to C is: 

I n 

(Max (C^ -Mln (C D 

Average of staging delays over all 
assembly operations in all fobs Is 
average staging delay. ASD. 


Figure 1. Staging delay in assembly jobs. 


completion of other components or sub-assemblies with which it is to be assembled 
(figure 1). 

A number of reports in the literature have attempted to devise priority dispatching rules 
to overcome ill effects of staging delay (Huang 1984; Adam etal\9%l\ Fry et al 1989). Fry 
etal (1991) also give a succinct summary of major findings in earlier works. A common 
theme in a majority of these works involves partial anticipation of staging delay problems 
ahead in some forms and consequent re-prioritising, using rather centralised strategies. In 
such schemes, delays in some branches of an assembly can lead to some loss of priorities 
for operations along other branches, in effect promoting a tendency for delays on those 
other branches with respect to the (original) ODDs. Such proliferation of possible delays 
can lead to undesirable consequences. There appears to be no dominant strategy emerging, 
robust with respect to depth/breadth of product structures as well as looseness or tightness 
of due dates etc. 

Thus we find that on the one hand the MOD rule has not been studied in the context 
of assembly job shops, and on the other hand a variety of rules studied for assembly job 
shops in recent literature do not seem to have put forth a reliable approach to the problem 
while they attempt some anticipation of staging delays ahead in time. In our approach for 
controlling the phenomenon of staging delays, we employ a certain look-back strategy 
(rather than look-ahead), rooted in the fact that long operational tardiness of a job would 
normally be associated with a long waiting time at some machine for some operation on 
the job (possibly upstream). Before discussing the motivation and logic of our approach 
(in § 3), we first indicate our proposed rule form in § 2. 


2. Proposed rule 

Let’s define a job in general as an assembly. This is represented as a directed convergent 
tree whose nodes represent operations and arcs indicate immediate precedence. The root 
of the tree is the job’s final operation. An operation having more than one immediate 
predecessor is an assembly operation. With each immediate predecessor, say p, of an as¬ 
sembly operation, is associated a component of that assembly. Components can be defined 
as maximal linear strings of operations which are mutually exclusive and such that only 
the first operation in the string may be an assembly operation, e.g. in figure 3 below in § 4 
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the components are indicated as operation strings A, B, C and D. The notion of compo¬ 
nents is also vital to the concept of staging delay in assembly job shops, as illustrated in 
figure 1. 

Typically, several operations on different jobs may compete for the same facility or 
machine at any time. An operation can qualify for competition for a machine’s time only 
when all its predecessor operations ai'e completed. Only when this happens, is an operation 
considered to be waiting for its required machine. In this context, if an operation in a job 
is waiting at a machine, and cannot possibly be completed within the operation’s due date, 
then the said job is referred to as an operationally late job by the said machine. 

Before defining our proposed rule, we assume that the modified operational due dates 
have been defined. Baker (1984) had indicated that the MOD concept, defined first for 
serial stmctures for jobs, could be usefully extended to assembly jobs. Here, we assume 
that job due dates have been generated based on total work content along critical path 
(TWKCP) as has generally been the case in literature (e.g. Adam et al 1987; Fry et al 
1989 etc.). An operation’s due date (ODD) is analogously generated based on the job 
due date and total work content along the path from the operation to job completion 
excluding the operation in question. Given ODDs, tlie MODs are defined following Baker 
(1984). 

We propose a priority dispatching mle of the following form. 

- Operationally non-late jobs to be ranked by increasing ODD (which is same as MOD 
here) and to form the lower priority class; 

- Operationally late jobs (which are waiting since they became eligible for processing at 
the machine) to fomi the higher priority class; and jobs in this to be ranked by increasing 
value of the priority index: 

(Processing time)/(Waiting time)", 

where a is the single control parameter (positive). Mathematically, for operation i of 
job j we define (with reference to table A2 in the appendix for symbol definitions): 

• ATi, J = arrival (eligibility) time of operation (/, j) at its required machine. Also, 
MODi,j gets defined at any time (before starting operation i of j) at Tnow as the 
maximum of ODD,-and BFTij (which is equal to Tnow + tij). 

We shall refer to our mle as MODWIP - modified operational due date rule incorporating 
waiting time induced prioritization (figure 2). Extreme values of or, viz. zero and infin¬ 
ity, impart SPT and FCFS priorities respectively to operationally late jobs. Incidentally, 

<1 

Thus, 

Priority Index (Tnow - AT/y)“}, if, {i,j) is operationally late 

= ODD/ y, otherwise. 

(0 < a < oo) 


Figure 2. Modification of MOD using waiting induced acceleration. 
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= —oo would correspond to LCFS priority; but we are not interested in a <0. Some 
alysis concerning the role of a is given in § 6 later. 

This type of interpolation between SPT and FCFS does not appear* to have been studied 
the literature, although well known rules in literature include; 

) Truncation of SPT by FCFS (among jobs with waiting time exceeding a limit) which 
forms a higher priority class; 

) truncation of FCFS by SPT (when queue length at a machine exceeds a limit). 

We sought a more elegant way of interpolating between SPT and FCFS than truncation 
SPT by FCFS and vice versa. At one extreme we know that SPT minimises Favg and 
avg and at the other extreme, it is easy to see the following: 

mma 1. In static single server scheduling FCFS minimises Fmax- 

his can be seen as a corollai*y of the result that FDD minimises Lmaxl by defining the 

3 due dates as the job amval times themselves.] 

We have considered the FCFS rule as a limiting member of the family of the weighted- 
*T rules, withpi'iority indices [p/w°^] as a goes to infinity. It turns out, as we shall see in 
), that this interpolation has interesting consequences; including the fact that as a goes 
infinity the behaviour of the rule comes to an essential agreement with the behaviour 
FCFS, in the case where waiting times are very large as compared to the processing 
nes. 

Comprehensive surveys of dispatching priority rules are given by Conway et al (1967); 
ire (1966); Day & Hottenstein (1970); Panwalkar & Iskander (1977); Blackstone et al 
982) and Haupt (1989). For a comprehensive account of fundamental rules and algo- 
hms in sequencing and scheduling, readers may refer to Lawler et al (1993). 

Having defined our rule form, we now turn to some aspects of its motivation. 


Preliminary motivation for using waiting times in dynamic prioritization 

simple motivation for our rule was as follows. Among the operationally late jobs at a 
achine, the ones likely to suffer large tardiness eventually, under the MOD rule, include 
3 rather long jobs (long operation on the machine). Such long jobs would suffer a con- 
lued discrimination under SPT priority (among operationally late jobs). Our rule gives a 
iressal to such long and operationally late jobs, conditional upon their waiting time, so 
to curtail the long tail of the tardiness distribution. The concern for the long tail here is 
iviously important in controlling staging delay (where any assembly operation waits for 
rival of all of its components). 

The use of waiting time in a dispatching priority index may be said to incorporate a 
nple look-back feature, which perhaps may appear to include redundant information if 
le were to look at the overall control decision problem in the framework of a Markovian 
jcision Process. Yet the fact remains that among the class of reasonably simply defined 
spatching prioritv mles, the use of look-back information in a rule cannot be readily 
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effectively in the pioneering works due to Jackson and others (referred to in Conway et al 
1967 and Kleinrock (1975)); where priority index was an affine function of waiting time. 
Moreover, such look-back was used precisely out of concern for “the upper tail of the 
waiting time distribution” (p. 180, Conway et al 1967). In our proposed rule form, we 
seek a weighted-SPT type priority index and use exponentiation of waiting time in our 
index, with a single control parameter. Concerning the choice of a in our rule form, we 
give some exact and heuristic analysis in § § 6 and 7. For the bulk of our experimentation, 
we used a = 0.5; and the results of this, in comparison with several well-known priority 
dispatching rules are given in § 4 below. 


4. Dispatching priority rules used in the experiment 

While considering dispatching priority for an operation in an assembly job, if we keep 
in view only the (unique) directed path from the said operation to the root node of the 
concerned assembly tree, then conventional job shop (non-assembly) priority dispatching 
rules are immediately applicable (as default options). 

On the other hand, an intrinsic complication in assembly job shop scheduling exists 
as follows. In defining dispatching priority mles, it would be desirable to know to what 
extent the waiting time encountered along one path to the root node of an assembly tree 
has correlation with the waiting time encountered along other similar paths; while any 
such correlation generally cannot be independent of the priority dispatching rules used at 
various machines. In fact, reliable anticipation of waiting times concerns transient analysis 
of a (dynamic priority) queueing network, which is practically out of question. Therefore, 


Table 1. Characteristics of well-known priority rules used for comparison in experimentation. 


Rule 

Remarks 

TWK 

Consistent priority across the machines; lowers MFT & MTD; long jobs face continual 
discrimination, so long tail of tardiness distribution 

HDD 

Consistent priority across the machines; does not recognise operational lateness; not 
effective with respect to MTD, if due dates are tight 

FCFS 

Discourages job passing (overtaking), thereby avoiding high variability in flow time and 
tardiness; in a single machine static scheduling, FCFS minimises F^ax (exactly as FDD 
minimises Lmax) (a simple to derive, but not explicitly referred fact, which throws light 
on the behaviour of our rule, MODWIP, when ot goes to infinity) 

MSLK 

Slack is difference between due date and earliest finish time; jobs with lower slack get 
higher priority; protects long Jobs from excessive delays; not effective at lowering mean 
tardiness; has certain anti-SPT behaviour (Baker 1984) 

MOD 

Maximum of ODD and EFT for that operation; effective in minimising the conditional 
mean tardiness due to SPT effect, when jobs are operationally late, but long tail of tardiness 
distribution not controlled 

TWK-IR 

Refer figure 3. Priority is based on min TWK and ties broken using max importance 
ratio; tries to maintain parity in progress; given two components of an assembly with long 
strings of operations, progress through operations along one branch gets de-prioritised 
unless requisite progress occurs along the other branch (component) and vice versa; some 
instability in this local control of staging delay can affect job tardiness. 
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5 n defining a simple yet sound scheme for updating operational due dates, is not an easy 
>blem in an assembly job shop. 

[n view of the two reasons cited above, it is not surprising that generally in testing 
patching rules for assembly job shops, a majority of contending rules are from conven- 
nal job shop literature; as rather little seems to be known about robust dispatching rules 
assembly job shops. 

Given below is a list of main basic factors or statistics which are very widely used as 
ilding blocks in dispatching rules in job shops (as borne out by well-known surveys such 
Panwalkar & Iskander (1977)) and a set of rules related to each basic factor. 

Work content, SPT, LPT, MWKR, LWKR. 

Due date, EDD, ODD, MOD. 

Arrival time, FCFS, LCFS. 

Operational milestones, ODD, MOD, OSL. 

Slack, MSLK, OSL. 

Parameter ratios. CR, COVERT. 

Por comparing our proposed rule, we have used certain rules from this list and also some 
►mising mles from assembly job shop scheduling literature, e.g. Fry et al (1989,1991). 
e rule given by Adam et al (1987) was neither considered by Fry et al (1989), nor here, 
it leads to a biased account of the unfinished work along other parallel branches in a job, 
1 more so when there are many levels of assembly. Rules considered here are described 
:able A1 of the appendix and certain relevant observations are given in table 1. 


RNOP ■ RMn«lnlns numbvr of op«r«Ron« alons that branch 
till md-product level. 


RNORforO ■ 3, 

RNOR for C ■ 2, 

RNOP for B ■ 4, 

to. RNOF.MAX ■ 4 <Along B) 

IR>RNOP/RNOF_MAX 

Currently, 

IR for D ■ 3/4 ■ 0.7S 
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Figure 4. Tall product structure. 


5. Numerical data and results 

In our experimentation, we were interested, among other things, in the effect of prod¬ 
uct structure on the behaviour of different scheduling rules. Therefore, a common set of 
three product structures, representative of tall, mixed and flat structures, was used for all 
scheduling mles. These are illustrated in figures 4,5 and 6. The other data on assembly jobs 
were generated probabilistically as is usually done in related literature. Table 2 gives the 
complete scheme for the generation of random jobs. Some elaboration follows the table. 

We follow the assumption of Poisson job arrivals and exponential processing times as 
is the standard practice in literature (e.g. Baker & Kanet 1983, Baker 1984). Poisson job 
arrivals imply a memoryless arrival process so that there is no question of use of, or bias 
due to, information on past or future arrivals (in expectation) or in choice of scheduling 
decisions. Choice of exponential processing time distribution is helpful here as it exhibits 



Figure 5. Mix product structure. 


Modified-operational-due-data priority rule 


91 



Components 

Figure 6. Flat product structure 


)ng tail. The relative goodness of priority dispatching rules is not likely to be affected 
choice of processing time distribution as long as ample variability in processing times 
1 sufficient queueing exist in the system. 

rhe values of mean processing time (5 h) and mean inter-arrival rate (12 h) were fixed 
ed on an intended aggregate shop loading or utilisation around 0.6. Staging delay for 
nponents would contribute significantly to components waiting in the system, in ad- 
on to the waiting of components for a machine after they become scheduleable. It 
s found that system utilisation around 60% still caused O(IO^) components waiting 
lie system on the whole, and this level of congestion in the system was adequate for 
aging out differences among different dispatching rules without causing extremely long 
ailisation periods for the system. 

Fables 3, 4 and 5 give numerical results for the fourteen dispatching rules tested in 
pect of the six performance criteria. The details of the notations are given in tables A1 
IA2. 


►le 2. Simulation set-up. 
ameter 

Tiber of machines 
duct structures 

Tiber of components per end product 

tribution for number of operations per 
iponent [range] 

tribution for routing of a component among 
:hines [range] 

cess of job arrivals [mean inter-arrival time] 
tribution for proc. time for any opm. [mean] 
j date tightness factor 
igth of each simulation mn 
i-in period length 

irval (period) between successive instants of 
tern statistics collection 

ue of a 


Value 

20 

3 

10 

Uniform [1,5] 

Uniform [1,20] 

Poisson [12h] 
Exponential [5 h] 
6 

12000 

2000 

8h 


Remark 
Non-identical 
Refer to figures 5, 6 & 7 
Includes sub-assemblies 


Includes set-up times 
DD = aryl + proc*6 
End products 
End products 


0.5 


Refer to §§ 6 and 7. 
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liable 3. Results for tall product structure. 


Rule 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

AS 

S/OPN 

458.86 

9.01 

72.86 

0.123 

332 

158 

TWK-FIFO 

467.33 

19.47 

109.74 

0.177 

464 

16C 

TWK-IR 

476.39 

21.82 

115.37 

0.189 

490 

\54 

IR-S/OPN 

463.91 

16.44 

97.02 

0.169 

381 

142 

IR-TWK 

458.09 

15.54 

99.16 

0.156 

382 

14C 

MS-IR 

456.84 

17.70 

111.08 

0.159 

491 

152 

MS-TWK 

478.16 

24.00 

123.02 

0.195 

493 

15S 

TWK 

419.56 

6.53 

83.63 

0.078 

351 

15C 

HDD 

463.4 

8.04 

68.75 

0.117 

335 

16: 

FCFS 

458.14 

6.78 

73.15 

0.092 

326 

15f 

MSLK 

447.61 

11.53 

90.54 

0.125 

357 

15S 

MODWIP 

455.85 

5.58 

64.75 

0.086 

307 

15< 

RAN 

487.11 

28.62 

130.11 

0.107 

545 

17] 

MOD 

460.03 

8.33 

78.81 

0.105 

303 

15t 


6. Regarding choice of a 

Since our dispatching priority rule form p/w“, where p is processing time and w is waitii 
time undergone, relates to the SPT/C or weighted SPT rule (Conway et al 1967; Bal< 
1974), we first explore certain implications of this analogy. 

Consider first an M/G/1 queue with static priority classes (Conway et al 1967; Jaisv 
1968; Klienrock 1975). For class 'k' jobs, assume there is a positive constant Ck represent!: 
a weighting factor for flow times of class k and pk is a constant processing time for jo 
in class fc. It is known that in this situation with static non-preemptive priorities amo 
classes based on 

^mipk/ck) < 


Table 4. Results for mixed product structure. 


Rule 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

At 

S/OPN 

431.50 

24.64 

77.92 

0.316 

374 

14 

TWK-FIFO 

428.21 

36.55 

110.74 

0.330 

522 

14 

TWK-IR. 

445.17 

42.81 

114.04 

0.375 

486 

13 

IR-S/OPN 

431.72 

33.46 

97.33 

0.343 

389 

12 

IR-TWK 

439.69 

33.06 

96.13 

0.344 

416 

12 

MS-BR 

421.59 

32.59 

102.54 

0.317 

442 


MS-TWK 

438.98 

38.04 

104.02 

0.365 

430 

13 

TWK 

405.73 

21.26 

85.92 

0.247 

371 

14 

FDD 

424.88 

23.59 

74.75 

0.315 

335 

14 

FCFS 

422.34 

19.04 

71.35 

0.266 

289 

13 

MSLK 

412.70 

24.87 

89.85 

0.276 

393 

13 
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Table 5. Results for flat product structure. 


Rule 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

ASD 

S/OPN 

421.55 

138.83 

166.98 

0.831 

607 

274.7 

TWK-FIFO 

441.42 

162.42 

199.70 

0.813 

661 

267.8 

TWK-IR 

414.50 

134.80 

168.60 

0.799 

564 

254.9 

IR-S/OPN 

410.01 

129.88 

161.23 

0.805 

557 

249.3 

IR-TWK 

406.89 

127.79 

161.25 

0.792 

538 

249.7 

MS-IR 

393.68 

118.91 

159.76 

0.744 

596 

248.5 

MS-TWK 

429.60 

148.76 

181.41 

0.820 

619 

263.0 

TWK 

389.17 

115.04 

156.77 

0.733 

582 

245.4 

EDD 

382.10 

106.53 

140.56 

0.757 

502 

258.1 

FCFS 

396.88 

115.61 

145.33 

0.795 

569 

257.8 

MSLK 

390.94 

113.66 

148.96 

0.763 

527 

242.9 

MODWIP 

383.83 

103.75 

135.25 

0.767 

482 

252.4 

RAN 

495.10 

198.42 

255.20 

0.853 

673 

277.3 

MOD 

386.90 

106.90 

137.57 ‘ 

0.777 

528 

249.7 


4 


and FCFS within each job class, (1) is equivalent to 

Min£ Ci I, (2) 

where the sum extends over jobs i waiting in the system (at a random point in time). 
Applying Little’s law we also know that (2) is equivalent to 


Min 


k J 


(3) 


where Wk denotes mean waiting time (in queue) for jobs in class k, since the average 
number of jobs in the machine is invariant as there is no avoidable machine idling. 

Interpreting the sum in (3) as the time rate of accumulation of system cost defined 
appropriately, it is seen that (3) is also equivalent to 


Min £{u;/? • c^?}, 


(4) 


where, R denotes the class index of a random job coming to the system and Wj? is a random 
sample of a waiting time within the jobs of class R. 

Now, let’s consider our mle form 


Min{p/u;“}. (5) 

Here, essentially the, priority ‘class’ of a job changes over time while the job waits, as it 
depends on both p as well as w (which increases). Therefore strictly speaking it is not 
obvious whether, analogous to (4), we would have 

Min£’{W/?.(W/?)“}, (6) 

if we use dispatching mle as in (5) and denote W/? a random sample of waiting times 
of jobs among class R, where the class R itself is drawn at random with probability 



94 


P G Awate and P V Saraph 


Xk/iY^jXj) of R being k. Writing the said random sample Wr simply as W, let us write 
(6) as, 

Min£:(W^^+“^). (7) 

Conjecture. In an M/G/1 queueing system with heavy congestion so that with overwhelm¬ 
ing probability, 

W » p, (8) 

the dynamic dispatching rule in (5) would nearly (asymptotically, in terms of ratio of 
objectives) achieve the objective in (7). 

The reasons why we have made a statement only in the form of a conjecture are as 
follows. 

Reason 1. The equivalence of (5) and (7) is not expected to hold if asymptotic condition 
of (8) does not hold. This is explained using proposition 1 below. 

PROPOSITION 1 

Consider a static situation where Jobs j = 1, 2,,..,« have already undergone waiting 
times Wj respectively. Suppose the objective is to 

(9) 

where, Wj denotes the total waiting time of job ‘j ’just before it gets served. If pj denotes 
processing time of job ‘j’, then priority rule in (5) asymptotically achieves objective in 
(9), if Wj » Pj for all j. 


Proof of proposition 1. The proof follows simply from the usual pair-wise interchange 
perturbation applied to a sequence of jobs. e.g. job 1 would immediately precede job 2 in 
an optimal permutation only if: 

P 2 ■ d/dx{(Wi -h ;c)(^+“)}at;c=o > PI • d/dx{(W 2 + (10) 

i.e.pi/W^<P2/W^, ( 11 ) 

which yields the proof. 

Note that in the absence of the asymptotic condition, essentially, (8) or (10) would not 
hold and nor would (11). That is why the objective in (7) cannot be expected to strictly 
follow from the rule form of (5). 

Reason 2. The linking behaviour of total differences in value of potential function for 
Markov chains, with costs in more than one dimension and with non-homogeneous in- 
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renewal process, the limiting behaviour of local differences in the renewal function de¬ 
fined on the linear state-space). In fact there exist well-known open conjectures concerning 
asymptotic behaviour in queueing systems (e.g. the conjecture of Jackson in Conway et al 
1967, p. 181) for which the standard limit theorems normally invoked in the context of 
Markov chains or Markov renewal processes do not seem to suffice. In fact the only 
known (to us) proof of optimality of the SPT/C rule in a truly dynamic scheme of pri¬ 
orities (not among classes of jobs but among individual jobs present in any given system 
state) is due to Kakalik & Little (1969) which exploits in great depth the duality between 
LP and MDP, and no comparable studies for more general objective function seem to be 
known. 

It may be mentioned here that as a goes to oo, what proposition 1 tells us, in the case 
where p <^W, stands in essential agreement with the fact seen earlier that FCFS minimises 
Fm&X‘ This is because in that case minimising Wmax is practically as good as minimising 
J^max froni the user’s point of view. 


Note. In the study of dynamic dispatching rules and their optimisation at least in an 
approximate sense, the use of Little’s law is found to shed some light on the nature of 
optimisation as seen from a queueing control theoretic point of view. This has also been 
seen, in the context of another type of problem, by Awate & Neeraj Kumar (1992). 

7. Empirical guideline regarding choice of a 

During early stages of our experimentation, and before our analysis given in § 6, we had 
considered a as powers of 2 [0.5, 1, 2, 4, 8, 16]. We found values of a greater than one 
to be not promising enough to warrant further consideration. Here we suggest a heuristic 
explanation for this. 

From a practical point of view, we seek reasonably low mean flow time, mean tardiness, 
conditional mean tardiness and a small value for (say) 99 percentile point of tardiness dis¬ 
tribution (to indicate avoidance of a long tail). Also, rather than seeking to control the 99 
percentile point, it could suffice to ‘keep in control’ the mean and variability of tardiness. 
Although variance of tardiness is not directly captured by any (non-central) moment of 
tardiness distribution, keeping in control the first two (first and second) moments of tardi¬ 
ness would be a reasonable objective. Now applying the heuristic arguments concerning 
our conjectured relation between mle in (5) and effect in (7), we are led to the choices of 
(1 4- a) as 1 and 2; i.e. a = 0 and a = 1. 

In light of the facts and analysis concerning our objectives, we are led to a guideline for 
range of a between 0 and 1. 


Table 6. Results for tall product structure - Variation in a. 


a 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

ASD 

0.00 

468.46 

10.29 

76.89 

0.133 

390 

166.8 

0.50 

455.85 

5.58 

64.75 

0.086 

307 

156.8 

1.00 

468.20 

10.06 

81.68 

0.123 

394 

164.9 
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Table 7. Results for mixed product structure - Variation in a. 


a 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

ASD 

0.00 

413.13 

19.84 

75.56 


291 

142.1 

0.50 

415.82 

15.80 

65.17 


274 

139.1 

1.00 

424.11 

21.66 

71.56 


291 



Our current experimentation tests values of a as 0, 0.5 and 1. Among these, a = 0.5 
performed better. We did not seek a fine tuning of value of a because we expect, based 
on the analysis above and numerical results, that the value of a recommended should be a 
simple or robust one. As noted by Baker (1974), the SPT effect itself often has unexpected 
side benefits, such as considerably reducing the mean of lateness, so that the concern for 
a = 0 case remains strong. On the other hand choosing a = 1 corresponds to a quadratic 
cost of waiting in (7). Thus, a = 0.5 seems a simple and robust compromise. Results for 
different values of a are given in tables 6,7 and 8. 


8. Conclusions 

Use of job waiting times at a machine in the dispatching priority index for operationally 
late jobs has been found to provide improvement in terms of tardiness measures: mean 
tardiness, conditional mean tardiness and 99 percentile point ^ of tardiness distribution. 
In case of flat product structure, we got more significant improvement in 99 percentile 
point of tardiness distribution; here the fraction of jobs tardy was high (more than 75%)*. 
Here, jobs in the upper tail of tardiness distribution were in relatively large numbers, and 
could be expected to be among operationally late jobs and consequently influenced by our 
rule’s discrimination based on waiting times. On the other hand, in case of tall product 
structures, the fraction of tardy jobs was small (of the order of 10%). Correspondingly here, 
our rule, based on discrimination among operationally late jobs, got invoked less often. 
The improvement we got was relatively more significant in mean tardiness indicating that 
some operationally late jobs, probably upstream, were accelerated by our rule and thereby 
prevented from being tardy at the end. In case of mixed product structure, we got significant 
improvement in mean tardiness, conditional mean tardiness as well as 99 percentile point 
of tardiness distribution. 


Table 8. Results for flat product structure - Variation in a. 


a 

MFT 

MTD 

CMT 

NTJ% 

NINTN 

ASD 

0.00 

390.73 

112.67 


0.799 

560 

256.3 

0.50 

383.83 

103.75 

135.25 

0.767 

482 

252.4 

1.00 

392.68 

109.34 


0.785 

544 

255.1 


^ For the allowance setting mechanism, namely, six times the processing time does not provide for extra allowance 
in case of flat structure which is exposed to higher staging delay; which leads to higher fraction of tardy jobs as 






As we had indicated earlier in the paper and in conformity with some of the earlier 
works cited from the literature, the SPT effect seems to play a crucial role in tall product 
structure and indeed we found the MOD rule to be a veiy strong competitor, especially in 
tall product structure. Our current results indicate the MOD rule giving a slightly better 
value for the 99 percentile point of the tardiness distribution (but not for MTD and CMT). 
In our further experimentation, involving variation of more number of factors, we intend 
to take a closer look at the cited phenomenon in tall product structures. 

Regarding choice of parameter ‘o;’ in the experimentation of waiting time in our pri¬ 
ority index, our results indicate that ot = 0.5 is better than a = 0 (equivalent to SPT 
among operationally late jobs) and a = 1 (equivalent to FCFS among operationally 
late jobs). Based on certain analytical relations between queueing control problems and 
priority dispatching problems, we have indicated earlier the desirability of a lying be¬ 
tween 0 and 1; reflecting concern for the mean and the second moment of tardiness. 
Based on our current findings, a. = 0.5 is consistently better at tardiness measures as 
seen in tables 6, 7 and 8. Further study of the role of value of a will be part of future 
research. 

Our new proposed rule, developed in the context of assembly job shop scheduling, gives 
an effective and efficient extension of the well-known MOD rule defined in the literature 
for job shop scheduling with due dates. On the other hand, we have derived a practically 
very interesting interpolation between SPT and FCFS dispatching rules and delineated its 
relation to a rich analogy with a queueing control problem. This also seems to warrant 
further study from the point of view of job shop scheduling in general. 

Our findings are also essentially in agreement with some previous ones in literature 
that the most complicated rules in job shop scheduling are hardly the most effective 
ones and simple-looking rules based on sound analysis do have scope for furthering 
improvements. 


We thank two anonymous referees for their comments and thoughtful observations which 
helped to improve the clarity at certain points in our paper. One of the referees has also 
pointed out to us a very fine paper “Efficient scheduling policies to reduce mean and 
variance of cycle time in semiconductor manufacturing plants” by Lu, Ramaswamy and 
Kumar in IEEE Trans. Semiconductor Manufacturing (1994, p, 374). While this paper 
shares the concern for variance of cycle times as we do, the problem environments and 
the approaches in their paper and ours are really quite different and therefore we have not 
formally included this in our actual reference list. 
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Appendix A. 


Table Al. Rules used in simulation. 



Explanation 

Formula 

Name 

Primary rule 

Tie-breaker 

Primary rule 

Tie-breaker 

S/OPN 

Slack per 

— 

Min/I(i/; • EFT;)/ 

- 


operation 


RNOP/] 


TWK-FIFO 

Total work 

First in, 

Max[S r, ]: i = current 

Min;[r;] 


content 

first out 

opm * to end opm 


TWK-IR 

Total work 

Importance 

Max[E f;]: i = current 

Refer to figure 3 


content 

ratio 

opm to end opm 


IR-S/OPN 

Importance 

Slack per 

Refer to figure 3 

Min;[(£/; - a)/ 


ratio 

operation 


RNOP;] 

IR-TWK 

Importance 

Total work 

Refer to figure 3 

Max[i; /;] : i = current 


ratio 

content 


opm to root opm 

MS-IR 

Modified 

Total work 

Max. IR, ifSL<0 

Max[S ti] : i = current 


slack 

content 

Min. SL, o/w 

opm to root opm 

MS-TWK 

Modified 

Importance 

Min. TWK, ifSL> 0 

Refer to figure 3 


slack 

ratio 

Min. SL, o/w 


TWK 

Total work 
content 

- 

Min;(r;) 

— 

EDD 

Earliest 
due date 

- 

Min; (if;) 

- 

FCFS 

First come 
first served 

- 

Min;(r;) 

- 

MSLK 

Min. slack 

- 

Min;(rf; ■ EFT;) 

- 

MODWIP 

Refer to § 2 

- 

Refer to figure 2 

- 

RAN 

Random 

- 

1/(0,100) 

- 

MOD 

Modified 

- 

Max[ODD, (Tiiow “b 

- 


operational 
due date 





* Opra - Operation. 


Table A2. Symbols and explanations. 


Symbol 

Explanation 

Formula 

i 

Operation 

- 

j 

Job 

— 

k 

Flow allowance factor 

6 

n 

Number of jobs (end products) 

- 

a 

Control parameter for MODWP 

0.5 

9 

Total processing time of job j 

E tij over all i for each j 

dj 

Due date of job j 

Vj + k*tj 

0 - 

Arrival time of job j 

- 

EFT, 

Earliest finish time of job j 

— 

O' 

Completion time of job j 

- 


Processing time of opm i of job j 

- 

Tnow 

Current time 

— 

RNOP; 

Remaining number of opras of job j 

- 




Symbol 

Explanation 

Formula 

ODD,,j 

Operational due date of opm i of job j 

ODD/_i,y + - rj) where, 

ODDo.y = rj 

ATi.y 

Arrival time of opm i of job J 

- 

SL,-,; 

Modified slack for opm i of job j 

dj — Tnow ~ S tij (over operations 
following i, j) 

MFT 

Mean flow time 

{E (cj - rj)]/n 

Tj 

Tardiness of job j 

max (lateness, 0) for job j 

MTD 

Mean tardiness 

{E iTj)}/n 

CMT 

Conditional mean tardiness 

{S iTj)]/nr 

NTJ 

Percentage of number of tardy jobs 

u-t/n [«/: all jobs such that cj > dj] 

NINTH 

99% point of tardiness distribution 

Refer to figure Al 

ASD 

Average staging delay 

Refer to figure 1 



Figure Al. 99% Point of tardiness 
distribution statistics. 
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Abstract. The increasing worldwide competition requires economical manu¬ 
facture, high quality and short delivery time. The Just-In-Time (JIT) philosophy 
of manufacturing is increasingly being considered by manufacturing organiza¬ 
tions, as a response to the increased pressure to supply high quality products 
with short delivery times and at low cost. A very simple shop floor control sys¬ 
tem that was developed by Toyota in the 1970s specifically for their Just-In-Time 
assembly plant, has received considerable attention in the Western World, and 
is known as the ‘kanban’ system (literally translated as ‘card’ system). Japan’s 
success has prompted many scholars and practitioners, to turn their attention to 
Japanese management practices. This article is to address the modelling, sim¬ 
ulation and implementation issues of Just-In-Time in flexible manufacturing 
environments. Priority nets are used for modelling and analysis of the kanban 
system. A large number of simulation runs are conducted/presented to probe 
the behaviour of the system, with respect to different parameter changes. 

Keywords. Just-In-Time; flexible manufacturing systems; kanban. 


Introduction 

St-In-Time (JIT) is considered to be one of the major factors contributing to the success 
Japanese industries (Sugimori etal 1977; Monden 1981,1983; Schonberger 1982). The 
r approach to production was originated by Toyota in 1970s in their car assembly plants 
Japan, and has found most success in repetitive manufacturing environments. The JIT 
lilosophy has attracted increased attention from researchers who have tried to explain JIT 
iplementation successes and failures outside Japan (Narendar et al 1995). The objectives 
the JIT system are to maximize the velocity of material flow through the plant, by 
ranging materials to arrive at each stage of manufacture just in time, to be processed and 
ove on to the next stage. JIT manufacture is a well-proven manufacturing management 
lilosophy which aims at eliminating the Work-In-Process (WIP) and optimizes material 



102 


K Ravi Raju et al 


delivery timings. Traditionally it is applied in the mass manufacturing environment. In 
recent years, it also being implemented in flexible manufacturing environment. A survey by 
Kim & Schniederjans (1990) which includes 450 US manufacturing firms emphasizes that 
a computer-integrated JIT system is more productive, cost efficient and highly effective in 
producing a quality product than a JIT or CM system alone. To date, there is a lot of reported 
research in the areas of JIT and flexible manufacturing individually. However, there is no 
detailed investigation of implementing JIT in flexible manufacturing environments. Hence, 
to realize the full benefits of JIT in flexible manufacturing, many implementation issues 
have to be investigated. 

Though, JIT offers many advantages such as WIP reduction, increased flexibility and 
quality, reduced space requirements etc., its implementation is not always a success. Indeed, 
on the one hand it necessitates a specific consumption of finished products that is regular 
enough, while on the other, it necessitates major changes in manufacturing process such 
as reducing setup times, increasing reliability, improving product quality and reducing 
customer response time (Schonberger 1982; White 1993). Thus these problems justify the 
interest in methods to evaluate the performance of these systems. 

Kanban system is the backbone of JIT philosophy. The philosophy behind the kanban 
approach is, the work should never be pushed on to the next work centre until that work 
centre is ready for it. This utilizes cards to authorize production and move material between 
work centres. In the literature, kanban systems have been studied using a variety of methods. 
Some researchers have analyzed JIT systems using mathematical programming (Bitran & 
Chang 1987; Mitra & Mitrani 1988; Wang & Wang 1991). Simulation is frequently used 
to evaluate JIT systems (Huang et al 1983; Schroer et al 1985; Lulu & Black 1987). 
System dynamics models are also used to study the kanban systems (Gupta & Gupta 
1989). Mascolo et al (1991) have demonstrated the suitability of Petri Nets for unified 
modelling of kanban systems. 

The objective of this paper is to establish priority nets (PRNs) as powerful and flexible 
tools for modelling and analysis of JIT manufacturing systems. It also details tlie differences 
between the implementation issues of conventional manufacturing systems (CMSs) and 
FMSs, and addresses some of the issues with petri net-based simulation. 


2. JIT in flexible manufacturing environment 

The success of JIT mainly depends on the nature of manufacturing environment. It is 
found to be more suitable for a repetitive manufacturing environment, where products are 
regularly ordered. The suitability of JIT for manufacturing environment is summarized 
in figure 1 (Voss & Harrison 1987). Systems with mid volume and mid variety (flexi¬ 
ble manufacturing environment) almost certainly have part or all of their manufacturing 
suitable for JIT, those at the top left and bottom right are suitable for selected applica¬ 
tions. 

The second criterion for determining the suitability of JIT is based on the complexity of 
oroduct structure and nrocess rnuHn<y fioiirf*. 7 /'Vncc Wnrricnn 1 
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VARIETY 

HIGH 


MEDIUM 


LOW 


LOW MEDIUM HIGH 

VOLUME 

Figure 1. JIT and the manufacturing environment (Voss & Harrison 1987). 

1 JITinFMSs 

r systems require production processes which can economically produce small runs of 
oducts. This flexibility is generally required to support the JIT concept. Economical 
nail production runs are achieved by reducing the equipment setup time which is the 
ted cost component associated with each production run. With the advent of FMS this 
quirement has become a reality. FMSs have the ability to produce a wide range of parts 
lickly in small lots, and thus fit well into the JIT approach. In addition, it can be stated that 
ithout some form of FMS or flexible manufacturing facility, it is difficult to implement 
T effectively within a batch manufacturing environment. 

2 JIT inflexible assembly systems 

exibility in assembly is a fast-growing prerequisite imposed by present day industry, 
le to the ever-increasing variety of models, and to quickly respond to market changes, 
le advent of programmable industrial robots spurred the development of FASs. JIT is 
iccessfully implemented in many FASs. For example, an FAS is installed at International 
omputers Ltd. (ICL, England) integrating assembly, testing and philosophy. This system 


JOBBING/PROJECT 
Part of JIT 

unsuitable, application of 
selected techniques 


BATCH MANUFACTURE 

JIT suitable for 
part of plant 


JIT suitable for 
entire plant 


LINE MANUFACTURE 

JIT suitoble for 
Manufacture 


PROCESS MANUFACTURE 

Much of JIT in place, 
opplicotion of 
selected techniques 
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Complex 

PERT 

structure 

1 


1 

MRP 1 


1 I 

I I 

JIT-l-MRP I I 

I I 


I I 


JIT I I I 

Simple 

I i I 

structure 

I I I 

I I I 

- ^ ^_I 


Simple routings Complex routings 


MRP : Manufacturing Resource Planning 

PERT : Programme Evaluation and Review Technique 

Figure 2. Suitability of HT for controlling production (Voss & Harrison 1987). 

help of central software and coded pallets (Talavase & Hannam 1988). Pye Telecommu¬ 
nications Ltd., successfully implemented FAS with JIT, with a 96% reduction in lead time 
and WIP (Wilson 1986). 

2.3 JIT in job shop environment 

JIT approach is traditionally thought to be not suitable for small industries operating as 
job shops. However, in a publication (Gravel & Price 1988), it is reported that Kanban 
method can be successfully adapted to the job shop environment. This is tested in a firm, 
manufacturing high quality outdoor clothing and the results of the pilot mn are found to 
be encouraging. However, the authors felt that some of the modelling and practical issues 
need to be resolved before successful implementation. 


3. JIT implementation issues 

The implementation issues of JIT in FMSs are different from those in CMSs, since (i) the 
configurations, (ii) the main criteria in each of the cases are different. In the environment 
of FMS/FAS, the multi-functional workers are replaced by robots, conventional machines 
are replaced by CNC machining centres and moving carts by AGVs. In CMSs, the main 
criteria is to maintain the lowest possible inventories even though the utilization of the 
some of the machines is reduced. But in FMSs, the utilization of machining centres must 
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high to justify this investment. Therefore, for implementing JIT in FMSs a balance has to 
achieved between inventory levels and utilization levels. Also the degree of automation 
j to be kept low to reduce the production cost. Masuyama (1986), has discussed some 
the implementation issues in FMS environment. 

The prominent issues for the successful implementation of JIT in either CMSs or FMSs 
:: (1) reduction in setup times, (2) reduction in conveyance times between machines, 
i (3) number of kanbans and kanban sizes. However, there are additional issues to be 
dressed in the context of FMSs due to the fact that (a) multiple-part types are processed 
aultaneously, and (b) the systems have to respond to the changes in part types and 
lumes. The additional issues are: 

I kanban related issues in multi product environment, 

I part routings and layout, and 

i AGV related issues. 


[ Reduction of setup times 

Dnden (1981,1983) separates setup times in CMSs into (i) internal (actions that require 
j machine to be stopped) and (ii) external (actions that can take place when the machine 
mnning) setup times. Efforts are then made to convert maximum possible amount of 
emal setup time to external in order to reduce the overall setup time. Internal setup time 
FMSs is almost negligible due to flexibility of CNC machining centres with automated 
3l-handling facilities. 



-Control Point M - Machine C - Charging station 
Figure 3. Layout of the system. 
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Table 1. 

Part routings and processing times. 


Part type 

Routing 

Processing times 

PRl 

LUS-M1-M2/M3-M4-LUS 

30-150-240/290-230-30 

PR2 

LUS-M4/M5-M1-M2/M3/LUS 

30-205/250-170-205/290-30 

PR3 

LUS-M2/M3-M5-M1/M4-LUS 

30-220/240-275-290/240-30 


3.2 Reduction of conveyance times 

Conveyance times in CMSs are reduced by belt conveyors, chutes or fork lifts. As these 
types are limited in flexibility for transferring parts/materials, they cannot be used in the 
environment of FMSs as asynchronous flow of material is mandatory. Owing to this AGVs 
are generally preferred in FMSs. 

3.3 Reduction of number of kanbans and lot sizes 

The inventory in a system is tightly controlled with the help of kanbans. A kanban system ^ 
lowers WIP levels by decreasing either the number of kanbans or the kanban size (lot 
size). Reducing production lead time is very crucial for the operation of JIT systems with 
varying demand, without high levels of either safety stocks or stock-outs. Recent literature 
indicates that there is a significant relation between batch size and production lead time. 
Hence, detennination of the optimum number of kanbans and lot sizes is very important 
in both CMSs and FMSs. i 

3.4 Kanban-related issues in FMS environment 

The following issues are important in the context of FMSs. 

3.4a Pallet as a container: In CMSs, special containers are used for each part 
type. In FMSs, parts are generally loaded onto pallets and routed through the machines. 

Pallets form the interface between the workpiece and the machines. Thus the pallets act 
as containers in this case. These pallets are generally coded and contain necessary infor¬ 
mation. 

FMSs are capable of processing families composed of a large number of part types. 

Two different kinds of families (closed and open family) can be taken into account. In 
the first case, the mix of the part types is predetermined and the system is setup for long 
time production of some part types. Under these conditions, dedicated pallets with special 
fixtures are used for speeding up loading and unloading operations. When open families are 
considered, the system must quickly be able to fit every requirement. In this case adaptable 
pallets equipped with all-purpose clamping devices are more suitable. These pallets can 


Table 2. AGV travel times. 


Segment number 

12 345 67 89 10 11 12 

Travel time (s) 

10 10 10 10 20 10 10 10 10 20 15 15 
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:eive more than one component. In one publication (Gentili 1991), it is reported that 
4S equipped with multi-piece adaptable pallets is able to work satisfactorily in JIT 
vironment. 


lb Number of kanbans: If there are more part types in the system at a time, the 
:al inventory in the system will be very high. This is because the inventories have to 
maintained at all stages for all part types for implementing the pull mechanism. This 
3blem can be overcome by freezing schedules for a small period of time (a week or 
days), which helps in keeping the variables low, and also continuing the same part-mix 
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Part type changes - Whenever a new part type enters the system, its routing and the 
number of kanbans have to be carefully determined. Then the ‘start up’ for the part type (i.e. 
generation of kanbans on the computers, passing the kanban information to the respective 
machining centres etc.) has to be planned. Similarly, whenever a part type goes out of 
the system, its ‘cut off’ has to be planned to gradually remove the inventories (kanbans) 
associated with the part type. 

Volume changes - Even though the part mix is stable over a period of time, individual 












Figure 6. Priority net model for part type 3. 


3.5 Part routings and layout 

FMSs offer flexible routing to the parts. However, simpler routings are advantageous in 
implementation of JIT. Adoption of Virtual Cell Manufacturing (VCM) principles and 
layouts is helpful for implementing JIT in flexible manufacturing environments. VCM is 
a new concept in small batch manufacturing (Drolet et al 1990). A VCM system does not 
promote opportunistic re-routing of each part after each operation like FMS. Instead, the 
route of a job (or a set of similar parts) is defined at a virtual cell. Whenever a new part 
type enters the system a new virtual cell is created. Drolet et al (1991), demonstrated that 
‘the VCM layout concept will function in harmony with JIT’. 
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Table 3. Interpretation of places. 


Place 

Interpretation 

Place 

Interpretation 

1 

Ml free 

54 

PR2 in M3’s input buffer 

2 

M2 free 

55 

PR2 in M2’s output buffer 

3 

M3 free 

56 

PR2 in M3’s output buffer 

4 

M4 free 

57 

PR2 at LUS after processing 

5 

M5 free 

58 

Finished part of PR2 available at LUS 

6 

LUS free 

59 

Counter for PR2 

7 

AGV available 

60 

Demand for PR2 

8-13 

C-kanban post PRl in kanban post 

61 

Place for modelling different 

14-20 

C-kanban post for PR2 in kanban post 


demand patterns of PR2 

21-27 

C-kanban post for PR3 in kanban post 

62 

Fixtures for PR3 available 

28 

Fixtures for PRl available 

63 

Raw material for PR3 available at LUS 

29 

Raw material for PRl available 

64 

Loaded pallet for PR3 available at LUS 

30 

Loaded pallet for PRl available at LUS 

65 

PR3 in M2’s input buffer 

31 

PRl in Ml’s input buffer 

66 

PR3 in M3’s input buffer 

32 

PRl in Ml’s output buffer 

67 

PR3 in M2’s output buffer 

33 

PRl in M2’s input buffer 

68 

PR3 in M3’s output buffer 

34 

PRl in M3’s input buffer 

69 

PR3 in M5’s input buffer 

35 

PRl in M2’s output buffer 

70 

PR3 in M5’s output buffer 

36 

PRl in M3’s output buffer 

71 

PR3 in Ml’s input buffer 

37 

PRl in M4’s input buffer 

72 

PR3 in M4’s input buffer 

38 

PRl in M4’s output buffer 

73 

PR3 in Ml’s output buffer 

39 

PRl at LUS after processing 

74 

PR3 in M4’s output buffer 

40 

Finished part of PRl available at LUS 

75 

PR3 at LUS after processing 

41 

Counter for PRl 

76 

Finished part of PR3 available at LUS 

42 

Demand for PRl 

77 

Counter for PR3 

51 

PR2 in Ml’s input buffer 

78 

Demand for PR3 

52 

PR2 in Ml’s output buffer 

79 

Place for modeling different 

53 

PR2 in M2’s input buffer 


demand patterns of PR3 


3.6 AGVrelated issues 

These are mainly, (i) the determination of number of AGVs, and (ii) dispatching. Some of 
the traditional dispatching rules used in push systems may not be suitable in JIT environ¬ 
ments. In the present work, some of the implementation issues such as number of kanbans, 
kanban sizes and vehicle dispatching are addressed with the help of Priority nets. 


4. Modelling and simulation of FMS 

Modelling and analysis of FMSs implementing a pull mechanism are highly complex due 
to the multiple part types and AGV-based transportation environment. Priority nets can be 
effectively used to address these issues. To demonstrate the efficacy of PRNs for modelling 
pull mechanism in FMSs, a system with the following features is considered. 

• The layout of the system is shown in figure 3. The system consists of five machining 
centres and a load-unload station connected by AGV network. It caters to a variety of part 
types. Three part types are considered for the present study. The machining sequences and 
processing times are indicated in table 1. 
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Table 4. Interpretation of transitions. 


Transition 

Interpretation 

Transition 

Interpretation 

1 

Ml processing PRl 

25 

Transportation of PRl, M3 to M4 

2 

Ml processing PR2 

26 

Transportation of PRl, M4 to LUS 

3 

Ml processing PR3 

27 

Transportation of PR2, LUS to M4 

4 

M2 processing PRl 

28 

Transportation of PR2, LUS to M5 

5 

M2 processing PR2 

29 

Transportation of PR2, M4 to Ml 

6 

M2 processing PR3 

30 

Transportation of PR2, M5 to M1 

7 

M3 processing PRl 

31 

Transportation of PR2, Ml to M2 

8 

M3 processing PR2 

32 

Transportation of PR2, Ml to M3 

9 

M3 processing PR3 

33 

Transportation of PR2, M2 to LUS 

10 

M4 processing PRl 

34 

Transportation of PR2, M3 to LUS 

11 

M4 processing PR2 

35 

Transportation of PR3, LUS to M2 

12 

M4 processing PR3 

36 

Transportation of PR3, LUS to M3 

13 

M5 processing PR2 

37 

Transportation of PR3, M3 to M5 

14 

M5 processing PR3 

38 

Transportation of PR3, M3 to M5 

15 

Loading PRl on pallet 

39 

Transportation of PR3, M5 to Ml 

16 

Unloading PRl from the pallet 

40 

Transportation of PR3, M5 to M4 

17 

Loading PR2 on pallet 

41 

Transportation of PR3, Ml to LUS 

18 

Unloading PR2 from the pallet 

42 

Transportation of PR3, M4 to LUS 

19 

Loading PR3 on pallet 

43 

Consumption of finished part of PRl 

20 

Unloading PR3 from the pallet 

44 

Consumption of finished part of PR2 

21 

Transportation of PRl, LUS to Ml 

45 

Consumption of finished part of PR3 

22 

Transportation of PRl, Ml to M2 

46 

Arrival of demand for PRl 

23 

Transportation of PRl, Ml to M3 

47 

Arrival of demand for PR2 

24 

Transportation of PRl, M2 to M4 

48 

Arrival of demand for PR3 


• The AGV track layout consists of both uni-directional and bi-directional segments. 
Control points are provided on the layout to avoid vehicle collisions. Bi-directional AGVs 
are used in the system. AGVs, when unassigned, wait in the charging stations provided on 
the track. 

• The system uses only conveyance kanbans (C-kanbans) for implementing pull mech¬ 
anism. 

The following assumptions are made: 

(1) An AGV takes 30 seconds for loading/unloading and 5 seconds to cross a control point. 
The time required to cover different segments is given in table 2. 

(2) Raw materials are available in perennial supply. 

4.1 Modelling 

A hierarchical modelling methodology is adopted. At the higher level the pull mechanism 
is modelled. This is referred to as system net. At the lower level the AGVS is modelled, 
which is referred to as logistic net. For the system net a part variety based approach is 
adopted. The PRN model of pull mechanism of each part type is made separately. These 
PRNM are then linked by merging the common D-places to yield the system net. The 
PRNMs of each part type is shown in figures 4 to 6. The interpretations of places and 
transitions are given in tables 3 and 4 respectively. The logistic net of the system is shown 
in figure 7, details are given in Ravi Raju & Chetty (1993). 
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Figure 7. Logistic net of the system. 


4.2 Simulation 

The system is simulated by the concurrent execution of the system net and logistic net. 
Control strategies are implemented with the help of D-places. D-places associated with 
machines resolve conflicts among transitions representing processing activities. D-places 
associated with AGVs resolve conflicts among transitions representing transportation 
activities. 

AGV movements are simulated by the logistic net, figure 7. Whenever an unload tran¬ 
sition in the logistic net fires, it changes the attribute AGV engaged to zero, and thus 
terminates the firing of a transition representing a transportation activity in the system 
net. An Ada-based software package JIT-SIM is developed for the simulation of JIT sys¬ 
tems. The basic inputs to the package include system net details and AGVS details. The 
interactive input to the package includes a number of AGVs and their locations, number 
of kanbans at each stage, kanban sizes, variability in the processing times, job selection 
mles, machine selection rules, and simulation time. In addition to the usual information, 
the JIT-SIM gives the important performance measures relevant to the JIT environment, 
such as percentage of back-ordered demand, mean tardiness, average in-process inventory, 
cumulative machine idle time, and overtime. 


Table 5. Inter-arrival times. _ 

Part type Mean arrival time _ Inter-arrival times (min) 

PRJ io 0 22 18 1 4 22 2 0 17 4 8 8 18 2 9 7 1 0 24 1 13 3 2 5 6 16 24 14 5 

12 0 17 5 24 1 5 18 0 0 16 2 10 18 27 0 3 9 5 16 0 12 4 5 9 
PR2 12 92 110939 15 11 28 12 163 1 18 106227921 232540 030 

40 12 7 13 16 00 54 35 

PR3 15 6 55321 141 110 16 130541095 1 1 321 52 26 6 7 41 1526 
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5. Analysis 

The following design issues are addressed in the present study: 

• Number of kanbans 

• Kanban size 

• Vehicle dispatching 

Being a demand-driven system, the functioning of a JIT systern starts with the arrival 
of a demand. A Poisson arrival pattern with a different mean arrival time for each part 
variety is considered in the present study. Inter-arrival times for each part type are shown 
in table 5. The same inter-arrival times are used in all simulation experiments in order to 
investigate the effects of different variables on system performance. In the initial analysis 
phase, several simulation runs are conducted to probe into the behaviour of the system. 
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(1) Base run: The purpose of the base run is to examine the model under typical condi¬ 
tions wliich will then serve as the basis for comparison in other experiments. The following 
parameter settings are used for this experiment. 

(i) Number ofAGVs = 3. 

(ii) Number of kanbans = 2. 

(iii) Kanban size = 2. 

(iv) No variation in processing times. 

In all the other runs that follow, the parameters are compared with those of the base run. 

(2) Run one (less kanban size): The purpose of this run is to evaluate the effect of reduced 
lot size. The same settings are maintained as in base run except that the size of the kanban 
is reduced to one. 

(3) Run two (less kanbans): In this experiment the effect of number of kanbans is inves¬ 
tigated. With the original value of kanban size the number of kanbans is reduced to one. 
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The intent of this test is to provide a contrast to the previous experiment (run one) and 
evaluate both the methods of reducing work in process inventory. 


(4) Run three (demand shock): The purpose of this study is to evaluate the sensitivity of 
the system to demand shock. A demand shock (6 products of each part variety) is introduced 
at the beginning of the second hour. A JIT system, to be efficient, should support significant 
demand shocks for a short period of time. 


(5) Run four (Cv = 1.5); The purpose of this run is to evaluate the effects of variability 
in processing times. All parameters of this run are same as that of the base run, except that 
there is 50% variability in processing times. 


(6) Run five (ideal JIT): In this final experiment the system is evaluated with the smallest 
possible work-in-process level. This is accomplished with the following: 
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All the above simulation runs are compared with the help of two important performance 
measures namely in-process inventory and cumulative machine idle time. Figure 8 shows 
the comparison with respect to in-process inventory. Experiments with low kanbans and 
a kanban size of two (base run, run 3, run 4) have higher inventory levels. Runs with 
smaller kanban size and fewer kanbans (runs 1 and 2) have reduced inventory levels. Run 
5 (ideal JIT) has minimum inventory and is as expected. An important feature of inventory 
can be observed in run 3, where there is a rapid decline of inventory during the second 
hour. This is due to the demand shock which comes into effect during this period and 
depletes the inventory. The comparison of runs 1 and 2 suggests that it is more effective 
to have a large number of kanbans with small size instead of a fewer kanbans with large 
capacity. 

The other comparison is made with respect to cumulative machine idle time, figure 9. 
It is found to increase as the inventory reduces. The idle for the third experiment (demand 
shock) is minimum. Also, idle time remains unchanged during second and third hours. 
This is due to the fact that machines are kept busy during this period due to demand shock. 
It is found that idle time increases steeply in all other cases. This may be due to the fact 
that machines are kept busy during this period due to demand shock. It is found that idle 
time increases steeply in all other cases. This may be due to lower consumption rates and 
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reduced number of AGVs. Another important finding is that the variability in processing 
times (run 4) results in increases in machine idle time. 


Number of kanbans and kanban size: The three parameters considered for this analysis 
are percentage of back-ordered demand, mean tardiness and average in-process inventory. 
Thr ee-dimensional plots in figures 10 to 12 show the effects of the number of kanbans and 
kanban size. As the number of kanbans increases, the percentage of back-ordered demand 
falls sharply at the beginning and reaches a minimum value. The same trend is observed 
with respect to kanban size. 

Figure 11 shows the variation of mean tardiness. This varies in a similar fashion as in 
the case of back-ordered demand. Figure 12 shows that the in-process inventory increases 
sharply with increase in the number of kanbans and kanban size. Therefore, considering the 
in-process inventory, mean tardiness, mean back-ordered demand, two kanbans at every 


Table 6. Effect of number of kanbans and kanban size. 


Number of 

kanbans 

Kanban 

size 

PRl 

Day’s schedule 

PR2 

PR3 

Completion 
time (s) 

AMir 

AVU 

2 

1 

54 

37 

36 

23545 

74.6 

93.0 

1 

2 

54 

37 

36 

22953 

76.2 

51.9 

2 

1 

40 

40 

40 

21951 

76.5 

93.4 

1 

2 

40 

40 

40 

20724 

83.2 

57.4 


AMU - Average Machine Utilization; AVU - Average Vehicle Utilization 
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Table 7. Effect of redundant work dispatching rules. 


Redundant work 
dispatching rule 

Cumulative machine 
idle time (min) 

Back ordered 
demand (%) 

Mean tardiness 
(s) 

Average vehicle 
utilization {%) 

ED 

111.98 

69.29 

828.6 

83.36 

STT 

105.98 

66.92 

463.3 

80.15 

MSPT 

110.95 

70.86 

795.3 

81.50 

RJ 

111.80 

70.07 

828.7 

82.87 


Job selection rule: SPT; Machine selection rule: LUM; Redundant vehicle dispatching rule: LUV 


Stage for each part variety, with size of kanban being ‘1’, are found suitable for the system 
under study. 

The analysis of the same system working to a daily schedule (single kanban) is also 
considered. In this case, the system acts as a push system for production and pull system 
for replenishments. Some typical results are shown in table 6. A kanban size of two with 
a single kanban at each stage is found to yield better results in this case. 

Vehicle dispatching: The following dispatching rules are considered for the present study. 
Redundant work dispatching rules: 

(1) Earliest Demand (ED); 

(2) Shortest Travel Time (STT); 

(3) Minimum work in queue (MWQ); 

(4) Random Job (RJ). 

Redundant Vehicle dispatching rules: 

(1) Least Utilized Vehicle (LUV); 

(2) Nearest Idle Vehicle (NIV); 

(3) Longest Idle Vehicle (LIV); . ^ 

(4) Random Vehicle (RV). 

Table 7 shows the effect of redundant work dispatching mles. STT is found to give 
better results with respect to vehicle utilization and other performance measures noted in : 
this table. Table 8 shows the effect of redundant vehicle dispatching mles. LIV is found 
to perform well with respect to tardiness and back ordered demand. NIV is found to give 
better result with respect to minimum vehicle movement. 


Table 8. Effect of redundant vehicle dispatching rules. 


Redundant vehicle 
dispatching rule 

Cumulative 
machine idle time 
(min) 

Back-ordered 
demand (%) 

Mean tardiness 
(s) 

Average vehicle 
utilization (%) 

LUV 

105.98 

66.92 

463.3 


NIV 


67.17 



LIV 


64.57 

354.8 


RV 


67.71 

675.5 

79.48 








6. Conclusion 


In this paper, an attempt is made to address the modelling, simulation and implementation 
issues of JIT in flexible manufacturing environment. Priority nets are proved to be suitable 
and competent tools for the modelling and analysis of kanban systems. Realizing the 
need for a thorough investigation of JIT in FMSs, an FMS implementing pull strategy 
is modelled. A large number of simulation runs are conducted to probe the behavi(Hir of 
the system with respect to different parameter changes such as variability in processing 
times, demand shock, kanban size and number of kanbans. Implementation issues such as 
number of kanbans, kanban size, vehicle dispatching etc., are effectively addressed with 
the help of priority nets, thereby proving their utility. 
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Abstract. In today’s competitive environment for software products, quality 
is an important characteristic. The development of large-scale software products 
is a complex and expensive process. Testing plays a very important role in 
ensuring product quality. Improving the software development process leads to 
improved product quality. We propose a queueing model based on re-entrant 
lines to depict the process of software modules undergoing testing/debugging, 
inspections and code reviews, verification and validation, and quality assurance 
tests before being accepted for use. Using the re-entrant line model for software 
testing, bounds on test times are obtained by considering the state transitions for 
a general class of modules and solving a linear programming model. Scheduling 
of software modules for tests at each process step yields the constraints for the 
linear program. The methodology presented is applied to the development of a 
software system and bounds on test times are obtained. These bounds are used 
to allocate time for the testing phase of the project and to estimate the release 
times of software. 

Keywords. Software quality; software process modelling; re-entrant lines; 
software product testing. 


1. Introduction 

In today’s competitive environment for software products, quality has become an increas¬ 
ingly important concern to software development organizations. Quality denotes a multidi¬ 
mensional concept. As an intrinsic product attribute, the quality of software is recognized 
by the absence of defects. If we view quality from the point of product operation, at¬ 
tributes such as reliability, efficiency, usability and integrity are useful; whereas from the 
point of view of product transition/revision, parameters such as portability, reusability, 
inter-operability, and maintainability are important (Ghezzi et al 1988). 
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Several models relating to software quality have been proposed in the literature. These 
may be broadly classified into three categories, each for a separate purpose (Kan et al 
1994). 

( 1 ) Reliuhility models for reliability assessment and prediction. 

(2) Quality management models for managing quality during the development process. 
Quality management models are still in their development and maturing phase. These 
models emerged from the practical needs of large-scale development projects. The 
phase-based defect removal model and several tracking models belong to this category. 

(3) Complexity models and metrics which are used by software engineers for quality 
assurance purposes. Complexity models explain quality from the internal structure 
and complexity of the software. 

Software reliability modelling is.more mature than the other two types. A plethora 
of software reliability models have been developed over the years but, in spite of the 
extravagant claims for their efficacy, none can be trusted to give accurate results in all 
circumstances. An important reason for this is the validity of the assumptions underlying 
these models. 

(1) A detected fault is immediately corrected, 

(2) No new faults are introduced during the fault removal process. 

(3) Reliability is a function of the number of remaining faults. 

(4) Failure rate increases between failures. 

(5) Testing is representative of the operational usage. 

(6) Software is treated as a blackbox without looking at its structure and the process of its 
development. 

Recently, there has been much emphasis on improving the software development pro¬ 
cess, with the assumption that this will lead to improved product quality. However, a 
precursor to improved processes is an understanding of the dynamics of current processes. 
With respect to software processes, there are two prevailing schools of thought (Bollinger 
& McGowan 1991): 

• International Standards Organization (ISO) 9000 certification, and 

® Software Engineering Institute (SEI) assessment based on the capability maturity model 
(CMM). 

Process models for quality ensure the application of process engineering concepts, 
techniques, and practices to explicitly monitor, control, and improve the software process. 
However, these models do not yield quantitative measures of parameters such as reliability 
and usability to denote the quality of the product in the end. 

Software development lifecycle is a model of the software process. There are many steps 
and activities in building a software product. The process followed to build, deliver and 
evolve the software product from the inception of an idea all the way to delivery and final 
retireanent of the system is called the software production process and the order in 'which 
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these activities are performed defines the lifecycle for the product. Many models which 
attempt to capture this process, also called the software lifecycle models, have been de¬ 
veloped. Such models are based on the recognition that software, like any other industrial 
product, has a lifecycle which extends from its initial conception to its retirement and that 
its lifecycle must be anticipated and controlled in order to achieve the desired qualities of 
the product. Dalai et al (1993) distinguish between the upstream phases comprising re¬ 
quirements, specifications and design, and downstream phases comprising coding, testing 
and maintenance of the software development process. 

Conventionally, the software process is supposed to proceed sequentially from require¬ 
ments to specifications, design, code, testing, and then to release. One extreme description 
of the process of software development conjures up the image of a waterfall flowing from 
requirements successively onto release with no feedback from a succeeding phase to a 
preceding phase. The other extreme envisions a spiral where feedback constantly loops 
back from a succeeding phase to a preceding phase as repair of the process is needed. In 
practice, the actual process could lie anywhere in between and one needs to accurately 
model the flow of software modules. This is analogous to the flow of silicon wafers under¬ 
going processing (such as deposition, photolithography, etching etc.) in a semiconductor 
manufacturing plant. A study of software faults in the different phases of the lifecycle 
suggests that a majority of faults occur in the coding phase (Marick 1990) and that coding 
en'ors have substantially more severe effects than do design errors. Testing thus occupies 
a very cmcial role in the overall software development process. The purpose of software 
testing is to detect errors in a program and, in the absence of errors, gain confidence in 
the correctness of the program. Efforts to improve the effectiveness of testing can yield 
substantial gains in software quality. 

In this paper, we propose a queueing mode! based on re-entrant lines (figure 1) to de¬ 
pict the process of software modules undergoing testing/debugging, inspections and code 
reviews, verification and validation, and quality assurance tests before being accepted for 
use. This is the first model of its kind which depicts the process of testing software as seen in 
the software industry. The model takes into account the structure of the software, the indi¬ 
vidual modules being distinguished by their criticality in the mission and implementation, 
their usage in the operational field from profiles and test strategies used for testing these 
modules. We consider in our model, the notion of imperfect debugging and that new faults 
can be introduced in the process of imperfect debugging. The paper is organized as follows: 



Figure 1. A typical re-entrant line. 
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Software development lifecycle is described in § 2, re-entrant lines and some important 
results are discussed in §3, bounds on test times for software products using re-entrant 
lines are described in § 4, and a case study to illustrate the methodology is shown in § 5. 

2. Software development process modelling 

A large softwai'e project after requirement analysis and design is given to different program¬ 
ming teams for development. It is assumed that some software engineering methodology is 
used. The software is divided into modules based on the functions, its size and complexity. 
These modules after development need to be tested at various stages of the product build¬ 
ing. Testing is done by the developers during the coding stage (local/unit testing). The 
module, after unit testing, is given to an independent test team, not involved in its develop¬ 
ment, for further testing. This independent test team detects the faults and these modules 
are sent back to the developers with a log of the tests done and their outcomes. This is also 
reflected in the configuration control management (CCM) of the project. The developing 
team then debugs the code and corrects the errors. The same sequence is followed for 
all the modules of the software. This process continues till the required reliability for the 
module is achieved or the testing time allotted for it is reached. Different criteria to stop 
testing have been suggested in the literature (Dalai & Mallows 1989; Musa & Ackerman 
1989). 

Once these modules are tested, they are integrated and tested for interface errors and 
inconsistencies across modules. These, along with the libraries and related documenta¬ 
tion and standards, form the complete product. The validation of this product is done 
by an independent verification and validation (IVV) team. Code walkthroughs, inspec¬ 
tions and quality assurance tests are done at all stages from coding to acceptance of the 
softwai'e product. These tests defer modules to further testing if they do not conform to 
requirements/standards prescribed, which would otherwise certify the product for release. 

This whole process can be viewed as a multi-class queueing network as depicted in 
figure 2. The test teams denote the servers and the modules represent the customers who 
aiTive for service (testing). In figure 2, the first team denotes the unit-testing team where the 
developers locally test the modules during its development, the second server represents 
the independent test team, the third team denotes integration tests and IVV; and the fourth 
team, the QA and system testing. 

Consider the flow of a tagged module M through such a process. At the first test team, 
TTl, the module is unit tested by the developers. This module M is tested by an independent 
test team TT2 and the errors (if any detected and located) corrected by TTl. Unit tested 
modules arrive for integration and.later for verification and validation. Interface errors, 
non-conformance with requirements, or inconsistent representational formats with some 
modules causing integration tests to fail result in these modules being sent back to the 
corresponding teams for correction. Finally, when all the modules are integrated, the system 
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3. Model development 

Semiconductor wafer manufacturing plants are organized quite differently from traditional 
assembly lines or job shops. The production process of a silicon wafer consists of imprinting 
several layers of chemical patterns on the wafer; the final end product obtained is a multi¬ 
layered sandwich. Each layer in turn requires several steps of individual processing such 
as deposition, photolithography, etching, etc. with many of the steps repeated at several 
of the layers. The machines to perform these individual steps are very expensive. Hence, 
the machines are not replicated but revisited by the wafers for processing at different 
layers. The distinguishing characteristic of such a manufacturing system (modelled as 
a multi-class queueing networks), called a re-entrant line, is that the lots revisit several 
machines at several stages of their life. The main consequence of the re-entrant nature is 
that several wafers at different stages of their life have to compete with each other for the 
same machines. Figure 1 shows a re-entrant line with 3 service centres and 11 buffers. 
Parts enter the system at buffer b\ \ and visit the centres according to a deterministic route 
as shown. Finished parts emerge from centre 3 after undergoing processing following a 
wait in £> 33 . Note that each part in this example line visits centre 1 three times, centre 2 five 
times, and centre 3 thrice. Scheduling in re-entrant lines, input releases and scheduling 
policies have a significant effect on the performance of this system. Several policies have 
been studied by Kumar (1994), Lu et al (1991) and Khan (1995). 

Several researchers have recently come up with analytical methods to obtain upper and 
lower bounds on the performance of scheduling policies in multi-class Markovian networks 
(re-entrant lines) (Kumar & Kumar 1994). These methods rely on assuming stability and 
obtaining a set of linear constraints on the mean values of certain random vailables that 
determine the performance of the system. Augmenting these constraints with others ob¬ 
tained using conservation principles, bounds on performance can be obtained by solving 
the resulting linear program. Bounds on the mean delay (called cycle time) are obtained 
with different scheduling policies. The cycle time in software testing process corresponds 
to the time required to test all the software modules before release to the customer. 

In the proposed model for software product testing, servers (machines) denote the test 
teams and parts (silicon wafers) denote the software modules undergoing testing and 
correction. Due to the large number of modules at different stages of testing, test teams 
also need to schedule their tasks to select the next module to test. 

Consider a set of {1,2, 3,..., 5} of 5 test teams consisting of professionals and de¬ 
velopers of the code. Modules are classified based on their criticality and usage (from 
profiles). Modules of similar reliability requirements enter the system for testing at a test 
centre ^(1) € {1,2,3,..., 5} where they are labelled as of class type Q. Let Ci class of 
modules being tested at ^ (L) be the last set of tests done on these modules. The sequence 
{ 5 ( 1 ), 5(2),..., s{L)} is the route followed by the modules for tests. These modules visit 
the next team s (2) after being tested at 5 (1) and so on. We shall allow for the possibility that 
s(i) = s{j) for some classes i ^ j and accordingly call this type of system a re-entrant 
line (Kumar 1993). 

For this system we assume: 

(1) Modules arrive into the system for testing according to a Poisson process with rate A,; 
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(2) The mean time to test for every class Ci is 1 /pi and the times to test are distributed 
exponentially. 

The first team denotes unit/local testing which is done by the programmer himself/ 
herself. These modules take 1 /p i amount of time for the local testing. After this testing, 
the module is passed on to Team 2 which is a peer test team, not involved in the development 
of the module. Any bugs located by this team are recorded in the error log and sent back to 
the developers for coiTection and testing thereof. The time for testing this module would 
now be governed by the mean time to test for modules of class C 2 . In this fashion, when the 
modules are approved by the Teams 1 and 2, it passes on to Team 3 denoting Integration 
testing and System testing. Team 4 denotes Product testing and QA which checks for the 
process of software development and the product developed. This team either accepts the 
product in which case it is delivered to the user along with the proper documentation, 
or it sends back particular parts of the product which have non-conformance reports to 
the design team or for further testing. This feedback defines the re-entrant path for the 
module. This completes one cycle of the downstream process for the software. Due to 
non-conformance of some modules to the specifications/standards, the product release 
date is shifted till another cycle of the process is completed. However, in Cycle 2, the 
mean time to test for some classes is less than that of Cycle 1, due to the learning factor, 
experience gained and familiarity with the system to generate efficient test cases which 
maximize the coverage. This is analogous to the product-in-a-process approach suggested 
by Laprie (1993) to develop families of software. The path followed by modules demanding 
different levels of quality in this process is varied. Based on this model, we compute bounds 
on mean test time for modules. 

4. Bounds on test times: The LP approach 

Consider a strategy to select the next module for testing, which is - 

(1) Nonidling: If there is any module to be tested then the test team does not stay idle; 

(2) Stationary: The decisions to select the next module depends only on the number of 
modules of different classes in the system (Lu et al 1991; Kumar 1993, 1994). 

Let us rescale time so that A, -f XI/Li f^i = 1- We use uniformisation in which we 
sample a continuous time system to obtain a discrete time system with the same steady- 
state behaviour. We sample the system at all service completion times, as well as at the 
arrival times of new modules to the system for testing. Let {r^} be the sequence of such 
random sampling times and let Fr„ denote the cr-field generated by the events up to time 
T„. Let Xft) denote the number of modules of class Ci at time t. Also, let Wi(T„) = 1 
if the testing team at cr(z) is working on the module of class Ci at time t, and 0 other¬ 
wise. We take all processes to be right continuous, and thus Xi (r) is the state after the 
«th event, while due to the stationarity of the strategy chosen, Wi{Zn) = 1 implies that 
the team a(/) is busy working on Ci class of modules in the interval [t„, Zn+i). Let us 
denote 


X^(r„) = (Xi(r„), X 2 (t„), ..., Xi{r„)), 


(1) 
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In the steady state, 

£[X^(r„+i).e.X(r„+,)] = £[X^(r„).e.X(t„)], 

for every symmetric matrix Q. We presume that the steady-state distribution has a fir 
second moment on the total number of modules at each buffer. For this equation to he 
we need 

E[Xi{zn+\).Xj{Tn+i)] = E[Xi{Tn)-Xj{rn)] for \<U j <L. 

Now consider the implication of the equality 
£[xf{r„+i)] = £:[xf(r„)]. 

From the state transitions for the buffer i shown in figure 3, we have 

X/(t„+i) = Xiizn) -h 1: exogeneous arrival to Ci at r„+i, 

= XiiZn) + 1: previous class tests completed, 

= XiiZn) - 1: current class tests completed, 

= Xi (Zn) : otherwise. 

Suppose every class Ci has an exogenous arrival process, which is Poisson with rate 
Also suppose that with probability qij, a module passes from class Ci to Cj. 

From the equality equation, E[X^{Zn+i)] — E[X^(Zn)], and using the statiom 
policy of the strategy used, we get the following equality constraints: 
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where zfj = £;[W/(r„).Xy(r„)] 

Zki I + f^k^kiZkj + ttkQkjZki 
\kelij) ) \k€l(i) ) k^i k^i 

IJ^jqjiiZjj - Zji - Pj) + tliqijiZii - Zij - Pi) 

- /x/(l - qij)zij - tijO- - qji)Zji = 0. 

Now using the nonidling policy, we get the following inequality constraints; 

Y1 fori = L; cr = I,S with a ^ o{i), 

{7X;)=<7} i^td) 

(5) 


and the nonnegativity constraints 

Z /7 > 0 fori,j = 1,L. (6) 

If the scheduling strategy is stationary and nonidling with a steady-state distribution pos¬ 
sessing a finite second moment, then the mean number of modules in the system at various 
stages of testing is bounded above by 

i j€a(i) 

and below by 

i j&aii) 

Equations (7) and (8) denote the bounds on the number of modules in the system. Using 
Little’s law, L = XW (Little 1961) and assuming that the arrival rate of modules to test is 
constant, we obtain the bounds on testing time for the modules. 


5. Examples 

Example 7. In this section, we consider the development of a re-entrant line based software 
process model for a firm executing a software project of moderate size (needing a few 
person months of effort). It is identified at the preliminary design level that the software 
is made up of 40 modules of similar complexity. The underlying re-entrant line model 
is shown in figure 4. It is assumed that there are two programming and testing teams. 
Software modules are first unit tested by the developers (Team 1) and Team 2 acts as an 
independent test team for these modules. 

For simplicity, we assume that new modules arrive for testing by Team 1 with rate X and 
the route followed by all modules in the re-entrant line model is deterministic. A class C/ 
of modules takes (l//x;) person hours to test a module. The linear program to bound the 
mean number of modules in the re-entrant line (figure 4) is: 

min[zii - 1 - 231 -f 222 + ZAi + zn + Z33 + Z24 + ^44] 
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Figure 4. A re-entrant line model for a software testing process. 


and 


max[ 4 l 1 + Z31 + Z 22 + 242 + ^13 + ^33 + ^24 + ^44]- 

The equality, inequality and non-negativity constraints are given by (4), (5) and ( 6 ) respi 
tively. Defining pi = X/pii ~ 0.25 and solving the linear program, we obtain bounds 
test times as [4.0-4.5] person months. With these estimates of test times, we can alloc 
approximately 18 person weeks for the testing phase of this project. 

Example 2. If the 40 modules in the software system of example 1 are classified bas 
on their criticality and usage, with more test time allocated to the high usage and criti 
modules, we can obtain realistic values for bounds on test times. We classify the modu 
based on the criticality of their function in the mission and the usage (from profiles) in 
use environment. The Criticality-Usage matrix is formed for the modules of the syste 
This was used to decide the class to which the module enters and hence the testing tir 
These data are summarized in table 1. 

In the above matrix, modules of {CU{\, 1)} are made members of class C\, m( 
ules of {CU(l, 2), CU(2, 1), CU (2, 2)) are made members of class C 2 and modules 
{Ct/(1,3), CU(2,3), C(3,l), C(7(3, 2), Ct/(3, 3)} are made members of C 3 , as tin 
are critical and frequently used modules. The testing times are varied accordingly in 
ratio of 1 :2:4 for the modules of classes Ci: C 2 : C 3 . The linear program is solved to obt 
the bounds on the test times for the modules of different criticality and usage. The resi 
obtained are summarized in table 2 . 


Table 1. The criticality-usage matrix CU(i, j). 


Criticality 

Usage 

Low 

Medium 

High 

Low 

6 

10 

n 

Medium 

4 

6 

2 

High 

4 

3 

3 
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Table 2. Bounds on test times for example 2 .(§ 5). 


Class 

Mean test time in person weeks (bounds) 

Cl 

[3,4] 

Cl 

[8, 11] 

Cl 

[19, 23] 


Total test time: [30, 38] person weeks or [7.5-9.5] person 
months (Assume 4 person weeks in a person month) 


^ith these estimates of test times, we can allocate approximately 9.5 person months of 
ng. If the milestone for completion of coding is set at the end of 14th month, then we 
allocate the testing and verification phase to end by the 24th month from the start of 
)roject. 


Conclusions and discussion 

ocess model which depicts the downstream phases of the software life cycle modelled 
re-entrant line is presented. Further, based on this model, a method to compute bounds 
5st times of software is presented. Due to priority test scheduling of modules, the re¬ 
ant model is not of product form and hence not amenable to closed form solutions for 
dy-state analysis. Bounds on test times are obtained by considering the state transitions 
i general class of modules which leads to a linear programming model. Scheduling of 
vare modules for test at each process step yields the constraints for the linear program, 
n the bounds on the test times, the product release times are obtained. We illustrate 
methodology using an application for which bounds on test times are obtained. For 
iules of varying criticality-usage factor, we observe that the test times are not scaleable. 
1 software development applications, a module’s route through test teams is not the 
e for all modules and is not deterministic. The current model can be extended to reflect 
situation with the introduction of path profiles and a route matrix for the modules 
ay Rao 1995). This model can also be used to decide on the release times of software 
i a specified reliability measure (Vijay Rao 1995). 


authors wish to thank the anonymous referees for their useful comments which helped 
nproving the examples of § 5, and Dr N K Srinivasan and Prof Y Narahari for useful 
ussions. 
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Competitive manufacturing systems ~ Part 2 


Foreword 

This is the second of the two special issues in the important area of competitive manu¬ 
facturing systems. The present issue contains eight papers covering several important 
topics. The first two papers view a manufacturing enterprise as a collection of value¬ 
delivering business processes and propose quantitative methods for designing world-class 
enterprises. The next three papers focus on new technologies for computer-aided design 
and engineering of high quality products. The following two papers are concerned with 
efficient algorithms for machine and job shop scheduling. The eighth paper in this special 
issue looks at recent advances in material handling systems, which constitute an essential 
subsystem in manufacturing plants. 

In the first paper of this issue, Viswanadham and Raghavan emphasize the important 
notion of flexibility. There has been a fair amount of research on flexibility of manu¬ 
facturing systems in recent times, but the research has looked at only subsystems in an 
enterprise. The work of Viswanadham and Raghavan takes a process-based look at flexi¬ 
bility and proposes qualitative and quantitative models to assess and manage flexibility of 
business processes. Three critical business processes are discussed: New product develop¬ 
ment process, order-to-delivery process, and supply chain process. After defining generic 
performance measures, including flexibility, for business processes, the authors identify 
the various types of flexibilities by emphasizing these three business processes. The au¬ 
thors also advocate strategies for managing flexibility based on the insights revealed by 
the quantitative models. 

In the second paper, authors Bhaskaran and Leung concentrate on the supply chain 
process and present quantitative models to facilitate reengineering of supply chains. The 
models that the authors employ are derived from the classical operations research/industrial 
engineering disciplines. Through several examples and a case study, the authors demon¬ 
strate the use of these models in identifying reengineering opportunities, evaluating design 
alternatives, guiding the selection of the best alternative, and deploying tools to implement 
the design. 

The third paper in the special issue is by Hock. In this paper, the author argues and 
demonstrates that design, engineering, and manufacturing are all closely related. By con¬ 
centrating on two well-known design methods, namely quality function deployment and 
Pugh’s concept and using a case study of designing and developing a hematology machine, 
the author makes a case for integrated product development. 

In the next article of this issue, Chandru and Manohar provide an overview of some 
major and some recently emerged paradigms for geometric modelling: Constmctive solid 
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geometry, boundary representation, non-manifold models, and voxel models. The authors 
then elaborate on the unique virtues of voxel models in the context of emerging manufac¬ 
turing technologies and argue strongly in favour of voxel models. The authors also look at 
important research directions in voxel modelling. 

Narayan, Rao, and Gurumoorthy, in their paper on feature-based geometric reasoning 
for process planning, present a framework based on domain-independent fonn features 
for automatic assessment of manufacturability and process planning for machining. Their 
approach offers a novel improvement over existing approaches by combining automatic ex¬ 
traction of domain-independent features with knowledge-based reasoning. This facilitates 
an effective framework for automated process planning. 

The next two papers are on the topic of scheduling. The first paper, by Wang, Luh, Zhao, 
and Wang, originates a Lagrangian relaxation based scheduling methodology for job shop 
scheduling by considering on-time delivery and low work-in-process as the objectives. The 
methodology is based on iterative solving of decomposed subproblems and the authors 
present an algorithm that combines backward dynamic programming with the conjugate 
gradient method to solve the scheduling problem in a Lagrangian framework. The method 
has very good performance in terms of rapid convergence and automatic evaluation of the 
quality of schedules obtained. Their methodology is a rich addition to the collection of 
techniques available for this difficult problem. 

In their paper on the on-line maintenance of optimal machine schedules, authors Aman, 
Balakrishnan, and Chandru consider the on-line scheduling problem of dynamically 
scheduled jobs with precedence constraints on a single machine to minimize the maximum 
completion time. They develop an efficient technique to reoptimize a schedule when new 
jobs arrive. The paper illustrates many algorithmic and analytical issues associated with 
developing effective methods to update schedules on-line. 

Material handling is a critical component in modem manufacturing plants because it 
provides material integration in the factory and can help cut lead times and inventory. 
There have been significant advances in the technology and operation of discrete material 
handling systems in recent times. Heragu and Rajagopalan provide a very useful overview 
of the advances in this area, in the eighth paper of this special issue. 

It has been a pleasure for us to work with all the individuals concerned with various 
aspects of this two-volume special issue. First, we would like to acknowledge the contin¬ 
uous support and ideas from Professor Viswanadham. Authors of papers have been most 
co-operative and have taken minimal time. Reviewers have been very prompt and thorough 
in their reviews. It is mostly because of this wonderful quick response from the authors 
and reviewers that these special issues have seen the light in exactly one year’s time (from 
conception to market). The Academy staff, coordinated efficiently by Ms Shashikala, have 
also contributed significantly to the compression of this issue’s lead time. We wish to place 
on record the excellent cooperation from our respective departments. 


April 1997 YNARAHARI 

M KAMATH 
RAVI ANUPINDI 
Guest Editors 
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Abstract. A manufacturing enterprise is a collection of interrelated, flexible, 
optimized business processes delivering value to the customers through high 
quality products and services, faster than competition. This view of an enter¬ 
prise enables one to consider the entire business system including the suppliers, 
product development, manufacturing, logistics, distribution, and retailing and 
to smoothen out the interfaces between them. Performance measures and per¬ 
formance measurement are important for monitoring, control and management. 
We identify and discuss eight performance measures for generic business pro¬ 
cesses. These include lead time, customer service, dependability, quality, flex¬ 
ibility, cost, capacity, and asset utilization. 

In this paper, we concentrate on flexibility of business processes with special 
emphasis on the supply chain and order-to-delivery processes. We attempt to 
provide clear definitions and measures of various types of flexibilities as well 
as discuss the relationship between product structure and supply chain flexibil¬ 
ity. The relationship between uncertainties, flexibility, technology, and product 
structure is clearly brought out in this paper. 

Keywords. Manufacturing enterprise; performance measures; flexibility. 


1. Introduction 

It is commonly believed that there will be dramatic changes in the ways successful manu¬ 
facturing companies are organized in the 21st century. These changes are basically driven 
by technological innovations, changes in political landscape, and demographics. In the 
developed world, electronic commerce has proliferated, and the information superhigh¬ 
way has made access to video connection and text-based information easy and almost 
free. Organizational structures, accounting practices, trading methods, compensation and 
incentive schemes that were so successful in the decades after World War II have be¬ 
come obsolete. In an earlier report (Viswanadham 1996), we have delineated issues that 
make manufacturing play a proactive role in inventing new businesses and organizational 
stmctures that give sustainable competitive advantage. 
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Manufacturing has gone through successive periods of great changes. New materi¬ 
als, such as plastics, ceramics and composites, new technologies such as computer-aided 
design, manufacture and inspection, and the internet, new techniques such as kanban and 
just-in-time, new bases for competition such as cost, quality, time or core competence have 
all been at the root of these changes. Currently, global competition, demanding customers, 
liberalization which provides a congenial environment for direct foreign investment, reg¬ 
ulations on environment, emergence of common markets, disintegration of large states, 
and volatile exchange markets have made manufacturing a more complex function. Cus¬ 
tomers want it all; low cost, low defect rates, high performance, on-the-spot delivery and 
maintenance without irritants. To meet such demanding customer needs, computer-aided 
automation, effective flexibility management, strategic alliances, management of end-to- 
end processes such as supply-chain process, new product development process etc., are 
important. 

This paper is organized as follows: In § 2, we survey the history of manufacturing 
emphasizing the paradigm shift from mass production to mass customization. In § 3, we 
describe the cosmic view of a manufacturing system including suppliers, distributors, 
employees, technology, competition and customers. In § 4, we define a manufacturing 
system as a collection of business processes and consider issues of analysis using this 
approach. 

In § 5, we describe the supply-chain process and the order-to-delivery process which 
are two important customer value delivery processes in a manufacturing company. In § 6, 
we identify eight performance measures for a business process. Section 7 is on flexibility 
of business processes. Since flexibility is the effective management of change, it is es¬ 
sential to conduct root cause analysis of changes and find ways of neutralizing or coping 
with the changes. We define business process flexibility in general and discuss the influ¬ 
ence of technology on flexibility. We bring out the importance of information-sharing and 
communication in enhancing the flexibility of business processes. In § 8, we present the 
definitions of various flexibilities of the supply-chain and the order-to-delivery processes. 
Product structure is the subject matter of § 9 illustrating the influence of product modularity 
on staged manufacturing and the supply chain. We conclude the paper in § 10. 


2. History of manufacturing 

Manufacturing, the world over, has undergone tremendous changes. Several companies 
in the West have gone through the phases of high growth, decline, restructuring, steady 
growth and so on. Several revolutions have occurred in the manufacturing arena: just- 
in-time, total quality management, time compression etc. In this section, we survey the 
history of manufacturing with emphasis on changes in factors driving the competition and 
changes in organization stmcture. 

2.1 Mass production system 

In the early twentieth century, Henry Ford revolutionized manufacturing with the introduc¬ 
tion of the transfer line for mass production. Complete and consistent interchangeability 


of parts and their assembly on a moving, continuous assembly line gave Ford tremen¬ 
dous advantages over competitors. Specialization and division of labour were the key 
concepts based on which both blue and white collar jobs were organized. Unskilled labour 
were trained quickly and supervisors, industrial engineers and quality inspectors ensured 
consistency and accuracy. The white collar jobs (so-called back-office work), were also 
broken down into small, repeatable tasks which were mechanized and automated. F W 
Taylor invented the principle of scientific management which made work more special¬ 
ized, precisely defined, interchangeable and optimized. Alfred Sloan as the President of 
General Motors was responsible for developing the easily scalable, pyramidal organiza¬ 
tional stmcture by creating decentralized divisions managed by specialists and coordinated 
by corporate headquarters. The mass production system was thus bom and the fathers were 
Ford, Sloan, and Taylor (Womack et al 1990). 

The mass production system is characterized by assumptions of constancy and pre¬ 
dictability of demand, and the logic of economies of scale and division of labour. The 
characteristics of such factories include: dedicated machines, long production mns, naiTOW 
product range, low skilled workers, command and control management, vertical internal 
communication, high volume, sequential product development, high inventories, make to 
stock, limited communication with the customer, large number of suppliers and dealers, 
mass marketing etc. This mass production paradigm has influenced generations of indus¬ 
trialists and has generated wealth for western nations for over four decades. Also, we can 
see that stable, homogeneous markets and standardized products with long life, dedicated 
mass production facilities, command and control organization structure, low skilled labour, 
and long production runs, are all mutually reinforcing (Milgrom & Roberts 1990). 

2.2 Modern manufacturing system 

The face of manufacturing has changed. Dedicated equipment is replaced by flexible 
machine tools and programmable multitasking production equipment (Viswanadham & 
Narahari 1992). Their use reduced changeover times and small batch size production 
became economical. Small batch sizes shortened production cycles and reduced work-in- 
process and finished goods inventories. Advances in computers and communications made 
possible direct contact with customers, suppliers and dealers. Rapid transport and commu¬ 
nications resulted in global competition. This has created customers who make relentless 
demands in quality, service and price contrasting with the gentle, grateful, loyal customers 
of Ford and Sloan. Toyota, under the leadership of Ohno and Toyoda perfected the lean 
manufacturing system with emphasis on just-in-time deliveries, quality, and planning pro¬ 
duction and product development jointly with suppliers and dealers. Manufacturers now 
use the point of sale information to determine the production schedules. There is a general 
strategic emphasis on speeding up all aspects of a firm’s operations: Shorter development 
cycles, quicker order processing, freedom from defects and speedier delivery. Make-to- 
order and almost instantaneous delivery have become more the rule than the exception. 
Frequent product improvements and new product introductions, combined with the need 
for speed have resulted in the use of cross functional teams for design and manufacturing. 

We see thus that Adam Smith’s world has changed considerably. Companies created 
to thrive on the mass production paradigm cannot succeed in the world of fast changing 
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customer demands, short product life cycles, changing technologies, fierce competition 
and fluctuating exchange rates. 

2.3 Competitions based on cost, quality, time and customer focus 

Lean production or Toyota production system has introduced the concept of waste elimi¬ 
nation by using just-in-time production. Waste, defined as any non-value adding activity, 
such as storage, inventory, transport, inspection, setups, machine down time, repair etc. 
should either be eliminated or minimized. Toyota has relentlessly attacked all forms of 
waste; reduced setup times, given importance to 'doing it right first time\ eliminated in¬ 
ventories, introduced kanbans to digger production, and used load leveling methods to 
smoothen the work flow. Toyota’s methods of cost reduction during the energy crisis years 
of the 70’s made them leaders of world class manufacturing. Also the new concept of 
target costing i.e. price is not cost plus profit but cost is market price minus profit, has been 
introduced (Womack ef 1990). 

The eighties have seen the total quality management (TQM) revolution spreading from 
Japan to the West. TQM strives for a totally integrated effort towards continuous improve¬ 
ment at every level of the company: Design, production, marketing and sales. The ultimate 
beneficiai 7 of TQM activity is the customer who receives high quality products/service at 
a reasonable cost. TQM involves process control rather than product testing. The aim is to 
produce zero-defect products through statistical process control and coordinated testing at 
input, production and final delivery points. Company-wide education and training, man¬ 
agement commitment to design and implementation of an effective quality, performance 
measurement and reward system are essential for the success of TQM programs (Smith 
1993). 

Flexible automation involving numerically controlled machines connected by an auto¬ 
mated material handling system and local area networks was the next important develop¬ 
ment. Computer-aided design and manufacture has made possible production of a variety 
of products at mass production economies. Consistent quality and manufacturing flexibility 
are two distinguishing features of factoiy automation. Together with factory automation, 
production planning and control systems such as MRP-II also emerged. The MRP-II system 
includes bill of materials, production plans and schedules, shop-floor control, inventory 
analysis, forecasting, purchasing, order processing, cost accounting, capacity planning etc. 
at various levels of detail (Viswanadham & Narahari 1992). 

Time-based competition is the issue occupying the minds of manufacturing executives 
even today. Customers want it all: price, quality and faster delivery, and hence firms must 
shrink the time from conception to consumption. Time reduction provides an important 
leverage not available in cost-reductioii strategies. By removing time from their operations, 
costs are reduced. In manufacturing industries, time compression requires that attention 
be given to all activities including order processing, scheduling, distribution and customer 
service, i.e. time compression should include all activities not just on a single production 



A new way of organizing businesses based on core competences is emerging in contrast 
to the product-centric view currently held by most companies (Prahlad & Hamel 1990). 
Here, businesses nurture a bundle of competences by developing skills, capabilities, which 
in turn allow the company to market a group of world class products. Core competences 
developed over time by integrating the skills and resources in an organization will give the 
company a sustainable competitive advantage. Honda’s knowhow in engines; logistics and 
inventory management of Federal Express; Sharp’s competence in flat-screen displays etc., 
have allowed these companies to manufacture a variety of products/services which appear 
diverse. Many companies are turning to the competence-centric view and are disinvesting 
in the non-core sectors of their businesses. 

We thus see that the logic of modem manufacturing is rooted in time and competence- 
based competition, flexibility and scope economies. Over the last decade a paradigm of 
mass customization has emerged with features such as flexible machines, short production 
mns, frequent new product introductions, niche markets, highly skilled team workers, cross 
functional teams, low inventories, no defects, make to order, continuous learning, extensive 
communication with customers, empowerment and long-term trust-based relationship with 
suppliers. We thus see that, in order to survive in the new world, closely coordinated changes 
are to be made in a whole range of the firm’s activities. Quick fixes such as buying a CNC 
machine centre, setting up e-mail facilities or buying CAD software tools or installing 
MRP-II will not help. 

3. The manufacturing system 

In this section, we present an integrated view of a manufacturing enterprise. Traditionally, 
manufacturing meant just the factory floor and during the 70’s and 80’s much effort spent 
around the world has aimed at improving technologies (FMS, CIM), practices (JIT, TQM), 
and effectiveness. 

In this paper, a manufacturing system means the entire enterprise and consists of several 
interconnected subsystems, all of which have to act in cohesion so that customer-desired 
products are delivered on time. The subsystems include (see figure 1). 

(i) Suppliers: They provide the subassemblies, components or raw materials just-in-time 
to the factory floor and play a critical role. Their flexibility, agility, defect control and 
organization structure should all be compatible with the goals, objectives and vision 
of the manufacturing system. For example, suppliers of companies introducing new 
products frequently, should also have similar capabilities. Having only a few loyal 
suppliers and effectively communicating to them the product and process designs 
are some of the new successful practices that Japanese firms have taught the rest of 
the world. 

(ii) Product development: Being fast enough to be the first in the market with the right 
product, is worth more to the prosperity of most businesses than any other single 
manufacturing function. Aggressive global competition, emergence of new tech¬ 
nologies, opportunities for high value quality of life products, and legislative and 
environmental requirements, are some of the drivers for new product development. 
Concurrent engineering, integrated product-process design, multi-functional teams. 



140 


N Viswanadham and N R Srinivasa Raghavan 


and incorporation of customer voice, are some of the enabling solutions for effective 
product development 

(iii) Factofy floor: A flexible low-inertia factory, responding quickly to design and de¬ 
mand changes of the existing products and also to the need to produce new products, 
is mandatoi 7 . The layout, machines, work force and instrumentation and control 
hardware and software, are important elements of the factory floor. Recent trends 
include flexible manufacturing systems under computer control using client server 
architecture and agent-based scheduling strategies. Production planning and control 
strategies such as MRP-II also reside in the factory host computer. 

(iv) Integrated logistics: Rapid response logistics to store and move raw materials, com¬ 
ponents, work-in-progress (WIP) and finished goods throughout the manufacturing 
system is another prime component of the manufacturing system. To maintain just- 
in-time deliveries, on time, to globally distributed customers and to make-to-order 
final products from subassemblies produced in several geographic locations, an agile 
logistic system that is mix- and volume-flexible, is essential. 

(v) Distributors: The ultimate customer is served through the distributors and they play a 
crucial role in selling the product to the customer. They can also provide feedback on 
customer voice and their expectations. Direct mai'keting, telesales, contracts between 
distributors and manufacturers, outsourcing and strategic alliances are some of the 
issues here, 

(vi) Customers: They are the ultimate users of products and services and receive the final 
output and generate revenue. Incorporating customer voice into the designs, delight¬ 
ing the customers with no irritants in the purchase and use of the products, gearing 
the organization towards delivering value to the customers, analyzing customers 
complaints and defections and fixing the process, are mandatory exercises. 

(vii) Organization structure: Design of a highly responsive organization structure through 
which information and decisions flow through very quickly, manned by people who 
make decisions and effectively communicate among themselves and with other stake 
holders, is critical for success. The tendency nowadays is to replace the traditional 
bureaucratic organizational structure with team-based flat structures. Management 
of end-to-end customer business processes such as order-to-delivery, new product 
and process development, using multi-functional teams headed by process owner 
and aided by software agents is also a popular paradigm. 

(viii) Competition: Staying close to competition, benchmarking their best practices, win¬ 
ning their core-customers are strategies for survival. Competitive intelligence is 
cuiTently an art and involves inferring from publicly available data, the methods and 
practices that make a company number one, 

(ix) Technology: This is the factor that pervades all the above subsystems. Successful 
manufacturing companies use advances in technology to introduce new products, 
to run the factories more efficiently and also to increase their customer reach. Elec¬ 
tronics technology has had tremendous impact on the control, instrumentation and 
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Figure 1. Competitive manufacturing as coordination of diverse skills and integration of 
multiple activities. 

the way businesses are conducted (Venkataraman 1994). Internet, EDI, electronic 
funds transfer and e-mail have made several traditional systems obsolete. CAD sys¬ 
tems have speeded up the design and retrieval process, and CIM has enabled flexible 
manufacture of high quality products. Managements should appreciate the value of 
technology and integrate it into their company’s activities through effective use of 
R & D (Upton & McAfee 1996). 


4. The business process 

Traditionally manufacturing systems are viewed as a sequential arrangement of functions 
such as design, manufacture, R&D, marketing, finance, and so on. The recent trend is 
to view a manufacturing system not in terms of functions, divisions or products, but as 
a collection of value-delivering processes. Functional or hierarchical structures typically 
present responsibilities and reporting relationships whereas process structure is a dynamic 
view of how the organization delivers value to the customer. 

Hierarchical organization structures based on functional divisions have several prob¬ 
lems. Each function acts as a silo and hands over its output over the wall to the next 
function. Turfwars, dominance by functions such as finance and marketing, result in slow 
progress of work through the system. Lack of proper communication between functions 
results in work going back and forth with long iteration periods (see figure 2). The hierar¬ 
chical arrangements in the functions require that decisions are to be sought from the top 
and work processes inch up and down the hierarchy ladder. Thus one finds that the ratio 
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Figure 2. Work flow through a functional organization. 


cross-functional teams in a coordinated manner (Davenport 1993). Process thinking orig¬ 
inated with the quality movement wherein emphasis is on process control and team work 
rather than product inspection and command and control hierarchy. A process approach 
to business implies heavy emphasis on how work is done rather than focus on specific 
products (see figure 3). 

DEFINITION 1 

A process is a structured, measured set of activities ordered in time and space, designed 
to produce a customer-desired output. 

Clearly, structured processes are amenable to measurement in a variety of dimensions: 
cycle time, defects, variability, flexibility etc. Examples of some typical processes include: 
order-delivery process, new product development process, supply chain process, factory 
floor process, maintenance process etc. A process perspective is a horizontal view of the 
manufacturing system that cuts across the organization with product inputs at the beginning 


Function 1 


Function 2 


Function 3 








Figure 4. Business process hierarchy. 


and customers at the end. Subscription to this view means deemphasizing the functional 
view of the business. The primary issue in vertical organization is the ill-management of 
hand-off between functions. Process orientation either eliminates hand-offs or coordinates 
them effectively. Processes are typically cross-functional. 

There are several ways of decomposing a business process. We find it useful to decom¬ 
pose it into various subprocesses and work processes as shown in figure 4. There are four 
different types of sub-processes in a business process. They are the following. 

(1) Work process: This is a value adding activity with well-defined inputs and outputs 
with someone responsible for it. Examples include machining by a machine centre, 
transportation from one location to another by a truck, bill processing by a clerk etc. 

(2) Interface between work processes: Interfaces are white spaces between work processes 
and interface management involves documented procedures for transfer of work from 
one work process to another. Some of the procedures may involve automated transfer 
of the workpiece from one machine to another. Generally, work processes may be 
within the same function and hence management may not be difficult. 

(3) Functional mterfaces: These are the procedures to be followed when the output of a 
work process in a given function in an organization is transferred to another in a dif¬ 
ferent function in the same organization. Interfaces between design, manufacturing, 
marketing etc., come under this category. These are smoothened out by using cross¬ 
functional teams to manage the subprocess in the organization and also by putting a 
measurement system in place that gives more weightage to the contributions made to 
the process by individuals and functions. One of the biggest achievements of process 
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orientation is recognizing the problems associated with functional interfaces and evolv¬ 
ing procedures for managing them. 

(4) Organizational interfaces: These are again relationships, procedures and activities 
performed when a process transits from one organization to another. Examples in¬ 
clude supplier-manufacturer, manufacturer-distributor-retailer, distributor-customer 
relationships. Here again through certification and alliances one can smoothen the in¬ 
terfaces and avoid all non-value adding and time-consuming activities such as selecting 
the suppliers through quotation, redundant activities such as incoming inspection by 
customers and outgoing inspection by suppliers, financial guarantees, multi-sourcing 
for reliability reasons etc. Through partnerships one can avoid overhead costs as well 
as smoothen the workflow. 

Generally, functional and organizational interfaces are unmanaged or managed infre¬ 
quently by higher levels of management. The management procedures, when they exist, 
are outdated. The prescription is to treat the interfaces also as work processes, with an 
ownership, well-defined inputs and outputs, and monitor their performance through ap¬ 
propriate measures. The goal is to smoothen the workflow and make it as continuous as 
possible. 


5. Supply-chain and order-to-delivery processes 

Supply chain process and the order-to-delivery processes are the most important business 
processes of a company and directly involve the customer. ODP starts with the arrival of an 
order from the customer and ends with the use of the product by the customer. In fact, some 
companies consider after sales support, and the return and recycling of the used product 
also as a part of the order-to-delivery process. 

5.1 Supply-chain process 
DEFINITION 2 

Supply-chain process (SCP) encompasses the full range of intra-company and inter¬ 
company activities beginning with raw material procurement by independent suppliers, 
through manufacturing and distribution, and concluding with successful delivery of the 
product to the retailer or at times to the customer (Cohen & Lee 1988). 

Supply-chain management is the coordination of the supply-chain process, i.e. inte¬ 
gration of the activities/subprocesses involved in procuring, producing, delivering and 
maintaining products/services to the customer who are located in geographically different 
places (see figure 5). 

Long term issues in SCP involve location of production, and inventory facilities, choice 
of alliance partners such as the suppliers, distributors, and the logistics chain. The long temi 
decisions also include make-to-order or make-to-stock policies, degree of vertical integra¬ 
tion, capacity decisions of various plants, amount of flexibility in each of the subsystems 
etc. (Connors et al 19951. 




n Tier Supplier 


I Tier Supplier Manufacturer Distributors Retailers 

Figure 5. The supply-chain network. 


Customer 


The operational issues in SCP are the ones we are more concerned with here. They in¬ 
clude cycle time, on-time delivery, cost effectiveness, flexibility and quality. Identification 
of customers and triggers for their loyalties (cost, after sales service, on-time delivery) are 
also to be assessed. 

5.2 Order-to-delivery process 
DEFINITION 3 

The order to delivery process consists of six steps starting from accepting orders through 
order configuration, sourcing the order, managing the order, monitoring and finally billing 
and cashing. 

We briefly describe each of these functions below (see figure 6). 

(1) Accept orders: Customers choose many ordering methods and most companies accept 
orders through traditional means like fax, mail or through sales representatives or 
through EDI, telemarketing or direct marketing channels. Orders can also be placed 
on the internet, provided the necessary infrastructure is available. Customers can then 
track the order on the internet. The order information should be visible to all members 
of the supply chain. One important issue in this step is order selection and prioritization. 
All orders need not be accepted and not all customers are equal. It is known that 80% of 
the orders come from 20% of the customers for 20% of the products. These customers 
should be treated as “sweet spots” and given priority. 

In this step, we have four tasks, 

(a) Order receipt. 

(b) Order selection. 

(c) Credit check: Previous customers are checked for default; new customers are 
checked for credit rating. A rule-based expert system can suggest credit limits. 
Any exceptions that may arise are handled manually. 
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(d) Order confirmation; This involves the following. 

- Check warehouse inventory and the allocation to current orders and back orders 
at all warehouse locations and determine if the current order can be met from 
nearby warehouses. 

- Check the production orders in various plants. 

- Make a decision on how the items from the order are to be shipped to the customer. 

- Estimate cost and time. 

- Dialogue with the customer on cost, delivery date and site planning if it involves 
equipment installations etc. 

- Confirm the order, price and delivery date and time to the customer. 

Figure 7 describes the controller interconnections that would enable the above functions 
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Figure 7. Delivery times for the three situations. 


(2) Configure order: This consists of the following steps. 

(a) Identify the complete list of products and services that are contained in the order. 

(b) Plan and source each product and service in the order. 

(c) Synchronize delivery of each of the items with availability of service personnel 
and transport as well as the convenience of the customer. 

(3) Source the order: If the order can be met from a nearby warehouse, then all activities 
relating to picking and packing are initiated. If the delivery time allows, orders may be 
sourced from a nearby manufacturing assembly plant by passing the local distribution. 
In this case, the order is included in the production schedule. If the plant does not have 
preassembled subassemblies, then they are sourced from assembly suppliers and is 
assembled and sent directly to the customer. The idea is to supply the order within the 
delivery window (see figure 7). 

(4) Order management: This is the final step and involves the steps outlined below. 

(a) Making plans for coordinated delivery of the product to the customer. 

(b) Customer’s order execution plan is communicated to each member in the supply 
chain responsible for shipping the product or providing a service such as trans¬ 
portation or installation. Planning for transportation can start before shipment is 
ready. 

(5) Order monitoring: Like in project management, the progress of the order is monitored 
and actual progress is compared with the anticipated progress. The signals of com¬ 
pletion or failure are sent to appropriate agencies which automatically monitor the 
progress. 

(6) Billing and cashing: Invoices are certified after installation of equipment and payment 
authorized to customer’s accounts payable, who transfers funds via EFT to company’s 
accounts’ receivable. 

The above ODP can be automated using EDI, bar coding, decision support systems and 
other IT tools to achieve rapid cycle times and high customer satisfaction (figure 8). 



Figure 8. ODP controller. 


6. Performance analysis: A process approach 

Traditional performance analysis of manufacturing enterprises assumes that the enter¬ 
prise is weakly coupled, i.e., the subsystems work almost independently. Paper-based 
communication, high inventories and arm-length relationships with suppliers, distribu¬ 
tors and other stake holders justified this assumption. Performance analysis and opti¬ 
mization are conducted for individual subsystems and near-optimal performance of the 
total system was obtained by summing up the results. Each company is happy with its 
own operational and financial measures. Each function concentrates on and optimizes 
its own measures: sales on orders, manufacturing on throughput, distribution on ship¬ 
ments that maximize the truck utilization etc. This could result in sub-optimization at 
the overall systems and delayed customer deliveries. Products are designed for ease of 
manufacturing (DEM) and in assembly (DFA) and so on, but without worrying about 
logistic costs, import duty tariffs etc. In the name of reducing inventory costs, compa¬ 
nies push the inventory either upstream to suppliers or downstream to distributors. Little 
do they realize that although they may have made cost reduction in the short term, the 
entire supply chain cost i.e., cost to the customer is the same or higher. Several interme¬ 
diate manufacturers (e.g. automobile part suppliers) have to maintain lots of inventory 
because their customers (Original Equipment Manufacturers) have erratic ordering pat¬ 
tern, bad scheduling etc.; their suppliers are unreliable and are not quality conscious. 
Although they are lean and mean in their operations, they become heavy to keep the 
company efficient. Individual subsystem optimization is thus leading to total system sub- 
optimization. 

Our thesis here is that a process-oriented approach to performance analysis and opti¬ 
mization is the correct one and all the ills mentioned above can be eliminated. Furthermore, 
demanding customers and global competition dictate that the entire chain, i.e., the supply 
chain or the order-to-deUvery process should be effective and not just a link like manu¬ 
facturing or marketing. In fact, future competition will be process based and it will be 
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the supply chain versus supply chain or new product development versus new product 
development, strategy versus strategy, etc. 

Also, present day manufacturing systems are strongly coupled, both in terms of material 
and information flows. All stake holders in the manufacturing enterprises are connected 
via internet and intranet and also share a common database. Inventories are made very 
low and customers (both internal and external) insist on just-in-time deliveries. Further, 
suppliers, distributors and customers have a fundamentally different and cooperative re¬ 
lationship called partnership for sharing information, finances and helping one another 
in resolving quality and technology problems. Decisions in location of various facilities, 
manufacturing plants, distribution centres etc. are now taken by considering all costs: 
supplier-manufacturing, customer-manufacturing, distribution, transportation and other 
infrastructure costs. One may find that typically manufacturing cost accounts for only 
20-30% of the total cost. These changes make the enterprise strongly coupled and any 
analysis has to be conducted for the entire monolithic system. 

As discussed earlier, we consider a manufacturing system as a bundle of business pro¬ 
cesses. The business processes are value delivery streams starting with a customer need 
and ending with customer satisfaction. We now define the performance measures for busi¬ 
ness processes. For ease of pr.esentation, we target all results from now on for the order 
to delivery and supply chain processes. The important performance measures of business 
processes are the following. 

(1) Lead time". The lead time of a business process is the interval between start and end 
of the process. It is the concept to market time in the case of the product development 
process and the clock time between placing an order to the delivery at the customer 
site in the case of ODP and start of raw material ordering till the final assembly reach¬ 
ing the retailer in case of the supply-chain process. Lead-time reduction by removing 
non-value adding activities, and effectively managing interfaces with suppliers, man¬ 
ufacturing, logistics and distributors is an important exercise. 

(2) Customer service: Delivery of customer-desired products at the right time, in the right 
place and in the right quantities every time is the goal of customer service. Customers 
could be external customers (or end users) or internal customers. 

(3) Dependability: This is the reliability of product delivery and is an operational issue. It 
measures the ability to manage disruptions, such as machine failures, worker absen¬ 
teeism, truck failure or supplier failures, rush orders etc. 

(4) Cost: Like the lead time, cost also provides rich insights into process problems and 
inefficiencies. Interface costs, margins, costs in negotiations, inspection etc. are a waste 
and provide avenues for cost-cutting strategies. 

(5) Quality: Quality is management of all the work processes so that they are on target 
with low variation. This is achieved through root-cause analysis of defects and failures 
and elimination of the causes of failures. 

(6) Flexibility: Ability to meet customer requirements under various environmental un¬ 
certainties in various dimensions such as delivery time, schedules, design and demand 
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(7) Capacity: This is the total output rate of the business process. All the work processes 
are to be balanced in capacity otherwise there will be bottlenecks and delays. Strategic 
alliances are common among various work process owners since vertical integration 
is almost impossible. A little amount of over-capacity to meet rush demands would 
improve operational measures. 

(8) Asset utilization: The assets in some business processes (like the supply chain process 
and the order-to-delivery process) such as manufacturing plants, warehouses, com¬ 
munications infrastructure, fleet of vehicles etc. are worth billions of dollars at times. 
Their utilization is an important issue. 

One could define other measures such as the throughput of the business process and 
the work-in-progress along the process. Also, product variety supplied and defects in the 
process are important measures which could be inferred from the above. We deal only with 
flexibility in this paper. 

7. Flexibility 

7.1 Anatomy of changes 

Literature defines flexibility as a system’s capability to cope effectively with a wide range of 
enviromnental changes and internal variations without deterioration in system performance 
in terms of cost, quality, lead time and on-time delivery. It is certainly a desirable virtue 
to possess for a manufacturing system in these times of global competition, turbulent 
changes and mass customization. It is certainly not a new concept and has been studied in 
the economic and organizational context. Earlier studies in flexible manufacturing were 
emphasizing flexibility in the context of the factory floor (Upton & McAfee 1994,1995). 
In this section, in contrast, we are concerned with the flexibility of the entire business 
process. 

Since a business process is an ordered set of work processes, it is essential that the work 
processes be flexible for the business process itself to be flexible. Flexibility enhances 
performance measures such as lead time, quality, and on-time delivery. It also allows the 
manufacturing system to cope with uncertainties such as those listed below. 

(i) Resource changes: Variations in the number of human and machine resources on a 
factory floor due to machine failures, absenteeism etc; transport breakdown in the 
logistic system; a rush order from a valued customer; supplier bankruptcy etc. are 
issues that arise routinely. Process management should be able to cope with these 
kinds of change. 

(ii) Design and demand changes in the product: The changes could be either planned 
or unplanned. Customer demand is random and this combined with inaccurate 
forecasting will cause uncertainty in the design and mix of products. Also, proac¬ 
tive introduction of new products to beat the competition will reflect as planned 
change. 

(iii) Technology changes: These could be continuous or discontinuous. Discontinuous 
technology changes such as the ones in the PC industrv and the hard disk drives are 
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difficult to cope with. The company should have the ability to predict and develop 
competences in new technologies for future product generations and also have the 
capability to evaluate the risk associated with new ventures. 

(iv) Socio-political changes: Deregulation of telecommunications, airlines, information 
networks etc. has a big impact on existing players. Liberalization of certain closed 
economies had the same impact. Legislations on health care had tremendous impact 
on hospital and health insurance systems. 

7.2 Coping with uncertainty 

We see that as we traverse from resource changes to design and demand changes in the prod¬ 
uct to socio-political changes, both the magnitude and the effect of change will increase. 
They can be classified as operational, tactical, and strategic changes and the flexibility 
strategies could be correspondingly named. The resource changes occur daily and appro¬ 
priate procedures have to be designed and built into the system. Events such as failure 
of a machine or a truck or a rush order occur at random times and places. Procedures, 
like those in hospitals and civil defense, should be evolved for all frequently occurring 
events. Design and demand changes occur, say monthly, and they have to be met through 
proper scheduling of orders. They are tactical in nature and involve suppliers, logistics 
and distributors. The technology changes are sporadic but occur in a predictable way for 
companies with learning capabilities. Some companies see the opportunity and lead the 
change in an offensive way. They are strategic in nature and involve proactive strategies 
for product development, technology adaptation, etc. Socio-political changes are outside 
their control but some companies turn these into opportunities as well. 

The flexibilities built into the enterprise should have the abilities to cope with these 
changes. These abilities are built into the enterprise via technology, procedures, control 
mechanisms such as scheduling, infonnation processing etc. It is important to analyze the 
changes that a system is subjected to, so that appropriate flexibility strategies could be 
designed and implemented. 

The best way to cope with uncertainty is to eliminate the sources of uncertainty. This may 
not always be possible but one can certainly reduce the amount of change. For example, 
customer surveys and monitoring point-of-sale information, redesigning product range to 
increase part-commonality and delaying final assembly until receipt of order, will reduce 
demand uncertainty; preventive maintenance, use of diagnostic expert systems, and built- 
in fault tolerance will reduce the down-time and increase the system availability; quick 
adoption of new technologies such as IT and their proper implementation will remove all 
non-value adding activities and cut costs and delivery time to a minimum. Then one can 
confidently face, in fact, lead price wars in the face of deregulation. 

At times, it would help eliminate the need for change by replacing a rough terrain 
with a smooth road rather than trying to cope with it. By providing a smooth road, and a 
constant environment for a vehicle, one avoided the need for flexible legs. By providing 
transfer mechanism in an automobile assembly, which can transfer the workpiece across 
machine tools, one has eliminated the need for sensory and manipulative human func¬ 
tions (Viswanadham & Narahari 1992). 
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Group technology cells, each focusing on a product family, streamlined the product 
flow on the factory floor thus eliminating the need for sophisticated material tracking and 
scheduling algorithms. 

A flexible manufacturing enterprise whose business processes can cope with all the 
above changes will be extremely complex, expensive and time-consuming to install. It 
requires redundancy in terms of excess capacity, space and time which again will increase 
the cost of the products. In an environment of fast-changing technologies, shrinking product 
life cycles, and changing customer attitudes, one should strike a balance between cost and 
time for implementation and hedge against near-term uncertainties. One has to recognize 
that each industry/firm has only a finite life and eventually has to open new factories for 
new products and phase out old ones. 

An analysis of the product line of the enterprise for the next five years using good 
forecasting techniques, is generally done to determine the uncertainty in demands and 
the appropriate manufacturing system (focused cellular manufacturing cells, flexible man¬ 
ufacturing system or flexible transfer line) is decided. The kind of flexibilities that are 
needed to meet the customer demand could also be determined at this stage. Methods such 
as strategic alliances, acquisitions, out-sourcing, subcontracting are used to augment the 
competences. Benchmarking is used to determine the best in class manufacturing practices 
and the performance gaps. 

Thus we see that a preliminary analysis of the uncertainties will lead to an appropriate 
manufacturing system configuration which can effectively cope with their changes induced 
by them. 

7.3 Flexibility and technology 

Information and communication technologies such as EDI, EFT, CAD, FMS, ASRS, 
groupware, local area networking, bar coding, internet, intranet are extensively used to 
speed up the process as well as to deliver better value to the customer. Automation tech¬ 
nologies such as FMS, ASRS, MHS are very expensive, others such as EDI and bar coding 
require process simplification and analysis before implementation. The communication 
technologies such as EDI, EFT etc. are integrative technologies and their added power 
comes from the fact that information collection, storage and processing could be done in 
real-time and transferred to all the stake holders. For example, order information, point of 
sale and inventory status could be monitored by all concerned such as distributors, manu¬ 
facturers, suppliers etc. This information can be gainfully employed to schedule production 
based on the needs of the customer. Advantages and use of all the technologies mentioned 
above is well documented in the literature (Corline & Essaides 1993; Hartley 1993). 

All the core business processes such as the supply chain process, order-to-delivery pro¬ 
cess, new product development process, factory floor process and logistics process are all 
influenced by the above technologies to improve the cycle time, quality and enhance the de¬ 
livery reliability (Hartley 1993). Quick changeovers, sharing of information (on customer 
order, product designs, inventory status, point of sale), supplier, logistics and distributor 
partnerships are enablers of high performance, low inventories and high flexibility (Corline 
& Essaides 1993; Srinivasan et al 1994). Customer desired product variety can be sup¬ 
plied without penalty on performance (Kekre & Srinivasan 19901. Low inventories result 



buy rigid or flexible technologies for a given demand forecast, economics and advantages 
of EDI, whether a supplier should adopt EDI, whether a customer should invest in a 
supplier’s infrastructure (production technologies, EDI, etc) (Wang & Seidmann 1995), 
vertical integration or virtual integration, whether a supplier should follow a leader model 
or follower model etc. (Srinivasan et al 1994; Wang & Seidmann 1995). 

Technology has made possible the sharing of various kinds of information and partner¬ 
ing. Virtual integration is an alternative to vertical integration and it provides flexibility 
to meet changing demands. In a changing scenario, whether to change the partnership or 
invest in enhancing the capabilities of existing suppliers is the big question. Much can be 
said on both sides. Also, choice of suppliers, distiibutors and transporters is an irreversible 
decision until the end of the contract. Changing the partners involves considerable time 
and effort and is comparable to the set-up time and cost. It is important to note that good 
partnerships will cut both cost and cycle time. A manufacturer can maintain the shelf space 
in a supermarket bearing the inventory, transportation and obsolescence costs in return for 
point-of-sale information (Venkataraman 1994). A distributor can maintain the inventory 
of hot spares and medical supplies to have big customers locked-in. Bennetton, Walmart, 
and Dupont have used technology to redesign their business processes for high flexibility in 
tenns of response to customers and high performance in terms of low cycle times (Corline 
& Essaides 1993; Hartley 1993). 

7.4 Flexibility in manufaturing enterprises 

It is common misunderstanding that flexibility is achieved through flexible machine or 
computer hardware acquisition. The truth is far from that. Indeed several companies in¬ 
curred losses, let alone breaking even, because productivity reduced with introduction of 
new manufacturing hardware. Process variety complicates the parts supply and assem¬ 
bly process because more parts require a greater coordination to get the right part into 
the worker’s hands at the exact instant the guided vehicle brings the car to the worker’s 
station. Because of the complexity induced by variety, many companies view flexibility 
management as a necessary evil (Sethi & Sethi 1990). 

Manufacturing enterprises increasingly look like restaurant chains. Customers place 
their orders, waiters transmit the specification to the kitchen and a team of cooks assemble 
the product. The products are designed already and part programs are available, as soon 
as customers order from a table look-up, products are scheduled and delivered. This kind 
of flexibility is static flexibility or product-centric view of flexibility. On the other hand, 
dynamic flexibility is creation of capability to act in response to opportunities as they arise 
over time. Competences to developing new designs and new products to manufacturing 
customer desired products and to deliver them faster than competition are tenets of dynamic 
flexibility (Macduffie etal 1996). 

There are four basic types of flexibility: mix, volume, new-product and delivery time flex¬ 
ibilities. See table 1. Each is important in a different environment (Suarez et al 1996). These 
flexibilities are implemented through a variety of factors such as production technology. 
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Table 1. Types of flexibility. 


Mix Flexibility 


Volume Flexibility 


New Product Flexibility 


Ability of a system to simultaneously produce a number of different 
products in a given period. 

Ability of a system to change significantly the production level and the 
composition of the product mix in a short time span. 

Ability of a system to add or substitute new products to the product mix 
over time. 


Delivery Time Flexibility Ability of a system to reduce the order-to-delivery time 


product management techniques, relationship with suppliers and distributors, human re¬ 
source management and product design. It is important to realize that different types of 
flexibilities are important in different competitive situations. For example, mix flexibility 
is important when a firm has a broad product line and caters to different market segments. 
There are several ways of achieving each type of flexibility. Mix flexibility may be achieved 
through skilled workers or programmable equipment. New product flexibility is needed in 
technology intensive markets. Volume flexibility is important in volatile markets. 

Flexibility management is a competence that involves skillfully managing several re¬ 
sources of the system including automation hardware, software, people, organization struc¬ 
ture, suppliers, customers, distribution channels, and factory floor control systems. It is an 
integrity-related competence and involves collective learning in the organization, coordi¬ 
nating diverse production skills and integrating multiple streams of technologies. It is a 
capability for deploying various resources of the company using the organizational pro¬ 
cesses to efficiently and economically produce a wide variety of part types. We elaborate 
this point with respect to the relationship with suppliers and distributors. 

A formal relationship between suppliers and distributors is essential for a positively 
correlated flexibility productivity relationship. First, the capability of a manufacturer to 
offer a rich variety of products is dependent on the supplier’s capability to produce a variety 
of component parts, i.e. the supplier’s flexibility in several dimensions: delivery time, mix, 
volume and new products. If components for each product in the mix are sourced from 
different suppliers, then the management overhead increases enormously. Secondly, when a 
plant has machine failure problems or when there are sudden volume surges time-sensitive 
orders can be sub-contracted to dependable contractors. 

The ability of the manufacturing system to produce a variety of products should be 
matched with the ability of the distributor to pass the variety on to the customer by proper 
advertising and maintaining appropriate inventory levels. Also, distributors by their close 
interaction with customers can easily identify their true needs and preferences so that 
the company can produce what sells rather than trying to sell what is produced, thus 
minimizing “market defects”, i.e. producing things that customers do not want. One cannot 
underestimate the influence of distributors in variety management, both in information 
collection and also in sales. Further, the use of information technology tools such as 
electronic data interchange, electronic funds transfer and customer sales tracking systems 
by dealers and suppliers enhance the delivery time flexibility. 

From the above discussion it is clear that flexibility needs to be defined, designed and 
created for end-to-end business processes. 
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8. Flexibility in business processes 
DEFINITION 4 

A business process is flexible if it can effectively manage or react to change with little 
penalty in time, cost, quality, or performance. 

This is of course a very general and abstract definition. Our discussion in this section 
will concentrate on issues concerning any business process; the specific concerns of new 
product development, supply chain and order-to-delivery process will be dealt with as 
examples. As we saw earlier, a business process is an ordered set of work processes and 
interfaces (both functional and organizational). The interfaces are generally managerial 
processes such as sharing information, rules and procedures etc., depending on relation¬ 
ships with suppliers, distributors, customers etc. Work processes typically involve tasks 
such as design, manufacturing, marketing, sales etc. It is essential that the interfaces are 
smoothened out and work processes are flexible for the entire business process to be 
flexible. Interfaces between functions and organizations are like setups which have to be 
reduced by continuous improvement. 

Previous studies on flexibility have so far been confined to factory floor technology, 
setup times, WIP, and so on. The volume and mix flexibilities are defined with the fac¬ 
tory floor in mind. Here we talk of the entire business process, as flexibility of some of 
these processes such as customer acquisition may not include the factory floor. Flexibility 
in human resources, infrastmcture flexibility and flexible management stmctures are all 
examples of flexible processes beyond the factory floor. 

Now we consider SCP and ODP and discuss flexibility issues in these processes. 

8.1 Supply chain process 
DEFINITION 5 

A flexible supply chain process is one that responds effectively to changes in volume, 
product mix, delivery times and delivery routes without deterioration in cost, quality and 
lead time. 

It is essential all subsystems be flexible for the supply chain process to be flexible. 
Flexibility management is a capability that has to be built up over time through the use 
of skilled work force, automated equipment, IT tools, computer control systems, bench 
marking and implementing the best practice and the like. 

We will first discuss the various types of flexibilities for the supply chain. Essentially 
flexible supply chains accommodate special customer requirements, provide customized 
service, allow product modification while the order is in process, introduce new design 
features and so on. 

8.1a Volume flexibility in supply chain process: 

DEFINITIONS.! 

A supply chain is volume flexible if a customer order with different product mixes and 
volume levels can be processed for rapid delivery. 
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It is essential that small batches of products are produced and delivered for a system to 
be volume-flexible. This implies that setup times are small all along the process. This is 
because the economic batch size in any work process (manufacturing or transport) is an 
integral multiple of its successor. 

8.1b Mix flexibility in supply chain process: 

DEFINITION 5.2 

A supply chain is mix flexible if the system can produce a number of products simultane¬ 
ously and deliver them to the customers. 

This capability indicates the breadth of the product line and ability for quick change- 
overs. The suppliers are either mix-flexible or there are more number of suppliers. Also 
warehouse and transportation should be able to handle different sizes, shapes and instal¬ 
lation procedures (multi-skilled labour). 

Excessive product variety induces several problems in both performance and manage¬ 
ment. The system complexity increases with a greater number of suppliers (at least two or 
more for each component), since establishing partnerships, sharing information, helping 
in quahty control, reducing changeover times etc. are all time- and effort-consuming pro¬ 
cesses. Thus variety means more design, more production planning and control, and more 
forecasting and more leftovers. While no one can disagree that one should have variety, 
it is necessary to find and manufacture the twenty percent of the products that win in the 
market. 

8.1c Routing flexibility in supply chain process: 

DEFINITION 5.3 

This is the ability of the supply chain to produce and deliver to the customer through 
alternate routes or, equivalently, each function (manufacturer, warehousing, transporting) 
could be performed in more than one location. 

Routes to supply equipment or to fill in the order can be ordinarily fixed but can be 
changed in the event of problems such as breakdown. Routing flexibility is generally 
obtained by duplicating each function in various locations, having over-capacity and re¬ 
dundancy in transportation, and efficient scheduling and control software. The average 
number of possible ways in which an order can be filled could be used as a possible rout¬ 
ing flexibility measure. For example, an order for a workstation from an Indian customer, 
can be filled either from Singapore or Europe or USA in a variety of routes. Depending on 
the time available, it is sent by air freight or by ship. 

8. Id Delivery time flexibility in supply chain process: 

DEFINITION 5.4 

A supply chain process is delivery time flexible if it can reduce or expand the delivery time 
as per customer requirements. 
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Here again, rush orders and delayed shipment requests are common from customers. 
Ability to reschedule the orders all along the supply chain, low variation of the lead times 
of all the work processes, quick change-over times, excess capacity in all resources, are 
some of the requirements for delivery time flexibility. 

8.1e New product flexibility in supply chain process: 

DEFINITION 5.5 

A supply chain process is new product flexible if generations of several new products can 
be rapidly designed and marketed simultaneously. 

This is a very important flexibility for most manufacturing companies. A cross-functional 
team with a process owner manages the product development process with reviews and 
testing at intermediate points in the process to maintain design quality. The team has 
members from among suppliers, distributors and at times even customers, apart from 
having functions within the organization, so that the designed products are manufac¬ 
turable, saleable, and satisfy customer specifications. Time-to-market is critical for this 
process to gain the advantages of first mover (Ulrich & Eppinger 1995). Patenting, nav¬ 
igating through regulatory agencies, production, marketing etc. augment product devel¬ 
opment capability. For example, pharmaceutical companies are most R & D intensive 
and the high cost of drug production mandates introduction of the drug worldwide to 
make the most from the effort. Thus a company should not only have capability to inno¬ 
vate new drugs but also have downstream capabilities to navigate it through regulatory 
mechanisms, and for manufacturing, marketing, and distribution in domestic and foreign 
markets. 

8.2 Flexibility in order-to-delivery process 

We now consider the flexibilities connected with order-to-delivery process. 

DEFINITION 6 

An order-to-delivery process (ODP) is volume-flexible if a number of customer orders 
with different product mix and volume levels can be simultaneously processed for rapid 
delivery. 

Mix, routing and delivery time flexibilities of an ODP can be similarly defined. Notice 
that volume flexibility of the ODP requires flexible order processing, and a flexible supply 
chain process with flexible manufacturing, logistics chain, marketing channels, etc. Also 
all these subsystems need not be within the boundary of a single firm, they could be part of 
a supplier-manufacturer-distributor value chain. For example, a catalog store has an ODP 
that is mix- and volume-flexible if it can reliably coordinate the supply chain. Similarly 
a car dealer is mix-flexible if he can arrange for delivery of any customer-desired car as 
quickly as possible. An ODP is routing-flexible if it has redundant suppliers, manufacturers, 
distribution etc. in various locations. Table 2 gives some measures of the flexibility for an 
ODP. 
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Table 2. Flexibility measures for ODP. 

Type Measures 

Mix flexibility • Number of different products that can be supplied. 

• Optimistic changeover times and costs among different products (function of 
scheduling) 

Volume flexibility • Stability of cost of delivery over varying levels of production volumes. 

• Smallest profitable volumes of operation. 

Routing flexibility • Average number of ways in which a product can be ordered, manufactured, 
and delivered. 

• Average delay due to subsystem failures. 


Three aspects of ODP flexibility - Flexibility is the response of a firm or a group of firms 
to uncertainty. We have identified three different types of changes: short-term, mid-term 
and long-term. Business processes and all systems involved in it have to manage these 
changes effectively. The built-in hardware, software and management procedures should 
be able to counter the variations or changes. We identify operational, tactical and strategic 
flexibilities in the ODP context. 


8.2a Operational flexibility'. Short-term changes occur very frequently, may be every 
few hours to every day. We assume that the organizational and functional interfaces are ail 
smoothened out, i.e. coordination of deliveries with suppliers, collaborative arrangement 
for distribution, are all in place. A business process is operationally flexible if it permits a 
high degree of monitoring and control to accommodate short-term changes. 

Basically operationally flexible firms have the list of possible contingencies that can 
occur or have previously occurred and a list of methods to deal with the situations. Like 
the emergence of procedures in civil, defence or medical fields, the approach taken can be 
reactive or for eliminating the root cause of the contingency. In either case the customer’s 
problems are resolved. A firm should have the formal procedure in place to automatically 
accommodate routine or anticipated events occurring at random points in time, with staff 
empowered to make decisions. Encouraging creative response by staff will help in opera¬ 
tional flexibility of processes such as ODP which are characterized by massive detail over 
a large geographical area. Use of outside service providers and strategic partners is also 
common. Some examples include: 

(1) withdrawal of a the drug from the market because of possible contamination in a batch; 

(2) failure of earth-moving equipment in a remote location, for which spares need to be 
delivered as soon as possible. 

8.2b Tactical flexibility: An important characteristic of tactically flexible processes is 
the ability to switch quickly and cheaply between products. Real time data collection 
at point of sale, networked organization stracture, EDI interconnection, bar coding etc. 
are important ingredients of the system. Good implementations include quick response 
manufacturing by Dupont and desktop publishing facilities by McGraw-Hill. Basically, in 
case of tactical flexibility, we are concerned with delivering known products to customers 
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subject to the required delivery schedule (both in quantity and time). Reduced setup times 
and costs at various machines in the suppliers’ and host’s manufacturing plants, fast com¬ 
munications, say through EDI, among all stake-holders, reducing transport costs either by 
using small trucks or by sharing the truck among different suppliers, packaging products 
in a form directly usable by the customer, having surplus capacity in the manufacturing 
and transport systems to take care of breakdowns in machines and transport, fault-tolerant 
control system to manage the system, are some of the attributes of such systems. Benet¬ 
ton, TESCO, and Walmart are examples of successful tactically flexible order-to-delivery 
business processes. Similarly, custom-made book publishing by McGraw-Hill is also an 
example of tactical flexibility in various business processes. In all these cases the changes 
are in the demands of the customer. The response of the company or group of companies 
to provide the range of products has to be quick. This is achieved through strategic net¬ 
working and communication of suppliers and customers in case of Benetton, TESCO and 
K-Mart. 


8.2c Strategic flexibility: This involves development of competence in technologies to 
lead the industry in all work processes and innovation in managing/eliminating interfaces. 
It is concerned with the ability to make good use of undisclosed opportunities, either 
through the production process or the product. It is the ability to respond to unmeasurable 
changes in the market conditions and unprogrammed advances in technology. 

Any type of flexibility arises from two sources: from assets (tangible or intangible 
objects useful in product delivery) and organization or coordination of these assets to gain 
competitive advantage. Strategic flexibility is the ability to deploy the assets and capabilities 
to create, produce and offer new products to markets in a changing environment, dynamic 
markets and shifting technologies. 


9. Product structure and flexibility in supply chains 

Performance and flexibility issues in supply chain and order-to-delivery processes are 
intimately related to the product structure and also to the logistics network organization. 
Modular designs with customization occurring as late as possible in the production are 
preferred. Also manufacturing facilities are staged with customization occurring with the 
help of local suppliers to meet local language, power, and communication standards. The 
combined product and logistic network design yields the following structures (Williams 
1981; Macbeth & Ferguson 1994): 

(i) A straight-line interconnection of plants, each adding modules to the semi-finished 
product sent by the previous plant and passing it on to the successor. The product 
variety is limited, the material flow unidirectional and the decisions are infrequently 
made as to when to stop the line for switchover to another product type or how large the 
batch size should be. Mass production and continuous manufacturing are examples 
of this type. Batch sizes are large in this case. 
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Figure 9. Product (manufacturing plant) structure with late customization. 


the customer end or at a local manufacturing plant or distribution centre. The structure 
of the plant looks similar to the one shown in figure 9. 

Personal computers, IC chips, disk drives, laser printers and other electronic equip¬ 
ment follow this pattern. Here a product variety can be obtained through dedicated 
plants to perform the last stages of manufacturing. Also, modular design of the prod¬ 
uct keeping in view the logistic costs will reduce the total logistic costs as well as 
help in effective supply chain management. The individual customer demands could 
be highly variable, but the total semi-finished product demand variability is low, 
Hence it makes lot of sense to maintain inventories at the subassembly level whicli 
are customized based on individual demands and thus maintain low lead times (Lee 
& Feitzinger 1996). 

The various stages of manufacturing could be dispersed across continents and 
transportation is often on the high seas. Thus the economic transportation batch size 
is larger than in closely located plants. This dictates that the production batch size 
of the predecessor plants also be high. Thus batch sizes and inventories are high and 
product variety is low in all the first (n ~ 1) stages. The final stage, however, has 
dedicated low volume plants. 

In the plant structure shown in figure 9, uncertainties in customer demands and 
long transportation times are basically met through staged manufacturing plants with 
dedicated technology, inventories, and large-to-medium batch sizes. Supplier man¬ 
agement, partnership with logistics agents, modular product designs are some of the 
enablers for good flexibility management in this case. 

(iii) We consider the third case of plants where customization starts early in the productior 
stages and which have a diverging architecture (see figure 10). Starting with a limited 
number of raw materials, a wide variety of finished products are produced. Sucl 
examples can be found in electro-mechanical systems such as motors, textiles, meta] 
fabrication, and chemicals. 

The enterprise is very complex from a managerial viewpoint. Several plants 
with similar manufacturing capabilities have to be managed, and have to maintain 
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Stages: 12 3 4 

Figure 10. Product flow and manufacturing plant structure with early customization. 


partnership relations with several suppliers, distributors, and customers. The plants 
could be staged or can be at central locations and made flexible using automated 
technologies. The batch sizes are typically low, and lead times are to be small. Low 
inventories are to be maintained for cost competitiveness. Integration using EDI and 
bar coding, and scheduling customer orders using point-of-sale information would 
increase the effectiveness of the supply chain. The mix flexibility for the supply chain 
is very high. Volume fluctuations are handled through large number of suppliers and 
quick feedback for point-of-sale information. The order has to travel through lots of 
organizations and the interfaces are generally not managed; instead high inventories 
are maintained. 

(iv) The final architecture is assembly-oriented with converging architecture. One finds 
such patterns in the manufacture of aircraft and in construction. Numerous raw ma¬ 
terials are transformed into sub-assemblies and finally into a huge assembly (see 
figure 11). Large number of mid-volume components are produced at various stages 
of manufacturing. Components are globally sourced from a variety of suppliers. The 
management is complex. Here again supplier management, flexible manufacturing, 
and information integration would enhance the enterprise’s performance. Small batch 
sizes, sometimes one of a kind, and low inventories are typical here. The supply chain 
is definitely mix-flexible, for e.g., aircraft manufacturers like Boeing have wide-body, 
narrow-body aircraft and the 777’s. The product mix gets smaller as the structure con¬ 
verges, high at the components level and low at the final assembly level. Scheduling 
to deliver products on-time is an important issue in both the construction and the 
aircraft industry. 
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Stage 1 2 3 4 - 5 


Figure 11. Converging product (manufacturing plant) structure. 

We have described a few typical architectures and used these as vehicles to discuss 
the flexibility issue. Our intention here is to point out the fact that flexibility, as the final 
user sees it in terms of variety, can be obtained in several ways depending on the product 
structure. 


10. Conclusions 

In this paper, we attempted a complete definition of a manufacturing system as a set of 
integrated activities involving suppliers, production, distributors, customers, competition, 
logistics etc. and emphasized the need for coordination among these activities via man¬ 
agement of end-to-end business processes. We also brought out the fact that a preliminary 
analysis of the uncertainties that a manufacturing system faces, and the subsequent design 
of the system for flexibility and also the design of control systems to reduce the effects of 
these uncertainties on system performance are essential for achieving sustainable compe¬ 
titive advantage. We defined the performance metrics for business processes. 

We have defined flexibility of business processes and presented definitions and measures 
for important processes such as product development, supply-chain and order-to-delivery 1 
processes. Finally, the discussion presented here is equally valid for continuous and chem- f 
ical manufacturing systems as well. Except for a manufacturing plant that has continuous t 
dynamics, the rest of the system is identical to the one discussed here. Even in continuous 
processes, there is too much attention paid to the manufacturing process and less to lo¬ 
gistics and the interface activities between various functions and organizations. Attractive 
gains can be made using IT and process management techniques in reducing the cycle time ; 
and improving quality in these processes. i 


We would like to acknowledge several colleagues for the time and effort they had spent 
in discussions and in refining the manuscript. Special thanks are due to Prof. Y Narahari, 
who spent enormous time in discussions on business processes, their management and 
performance analysis. 
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Abstract. The first-wave of reengineering, during the first half of the nineties, 
focused on making organizational changes and used primarily information mod¬ 
els of supply chains to integrate business processes. Quantitative models are ex¬ 
pected to have a significant impact in the second-wave of reengineering through 
the deployment of performance and optimization models, economic analysis, 
and decision support systems. In this paper, we focus on the vital role that quan¬ 
titative modelling techniques such as those founded in Operations Research and 
Industrial Engineering can play in reengineering supply chains. These quantita¬ 
tive models can extend the business process reengineering concepts to provide a 
concurrent reengineering framework for modelling the supply chain processes, 
identifying reengineering opportunities, evaluating design alternatives, guid¬ 
ing the selection of the best alternative, and deploying tools to implement the 
design. We illustrate such use by surveying current industrial practice and in¬ 
troducing real world examples based on our practical experience in solving 
supply chain and reengineering problems. 

Keywords. Business process reengineering; supply chain management; quan¬ 
titative models. 

1. Introduction 

The commercial potential of a global market as well as the significant advances in informa¬ 
tion and communication technologies is driving companies world-wide to reengineer their 
deployed assets and distributed capabilities to form manufacturing supply chains that offer 
significant business advantages. A manufacturing supply chain is an integrative approach 
to managing the inter-related flows of products and information among suppliers, man¬ 
ufacturers, distributors, retailers, and customers. It consists of cooperative inter-linked 
processes that transcend production, distribution, and transportation functions and per¬ 
mit the coordination of strategies, tactical plans, and operations across these functions. 
Supply chains are a major departure from the traditional ways in which businesses have 
been organized, i.e., vertically integrated along functional lines with significant built-in 
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Constituting manufacturing supply chains therefore requires major structural changes 
in the way companies are organized to produce and distribute goods and services. A widely 
adopted approach, with mixed success, in making these changes is business process reengi¬ 
neering (BPR). In BPR, like in manufacturing supply chains, the business process is viewed 
as a horizontal flow of activities. There is a wide range of opinion on what constitutes BPR. 
Hammer & Champy (1993) define BPR as a breakaway from old rules, adopting a radi¬ 
cal approach to changing business. Hammer and Champy’s approach involves four basic 
stages in BPR, namely, mobilization of a team, diagnosis to identify weaknesses in exist¬ 
ing design, redesign to create breakthroughs, and transition to roll out and institutionalize 
the results. Harrington (1992) on the other extreme views BPR as an incremental process 
improvement approach. Aikins (1993) regards BPR as a redesign process that leverages 
the potential of information technology. 

Manufacturing supply chains and BPR emerged as independent concepts in the mid¬ 
eighties and early nineties respectively and as such are well documented. In this paper, 
we focus on the vital role that quantitative modelling techniques such as those founded in 
Operations Research (OR) and Industrial Engineering (IE) can play in reengineering supply 
chains. These quantitative models can extend the BPR concepts to provide a framework for 
modelling the supply chain processes, identifying reengineering opportunities, evaluating 
design alternatives, guiding the selection of the best alternative, and deploying tools to 
implement the design. We will illustrate this framework by introducing real world examples 
based on our practical experience in solving supply chain and reengineering problems. 

The paper is organized in five sections. Following the introduction, the interest in sup¬ 
ply chains and the motivation for modelling supply chains is reviewed in § 2. Section 3 
looks at the role and impact of quantitative modelling in reengineering. Practical examples 
illustrating the use of quantitative models in reengineering ai'e presented in § 4. Finally, 
§ 5 offers some conclusions. 



p 


2. Why model supply chains? ^ 

Consider for example a large manufacturing company that caters to a global market and has j 
assets deployed world-wide. A scenario analysis based on a typical consolidated financial ; 
summary is shown in figure 1 to illustrate the performance expectations that motivate supply | 
chain management. Supply chains impact the five basic variables that are commonly used ’ 
in measuring business performance, namely, sales, cost of goods sold, expenses, inventory, 
and accounts receivable. i 

As is clear from figure 1, the profitability and efficiency of the enterprise can be improved ! 
dramatically if gains can be realized along multiple performance dimensions (modelled by ; 
progressively changing the basic variables in the scenario analysis). Supply chains by virtue 1 
of their comprehensive enterprise-level focus have the potential to realize such concurrent , 
performance. Therefore, from a business standpoint supply chains merit attention and [ 
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Impact Variables 

1 

2 

3 

4 

5 

Sales 

1.00 

1.00 

1.02 

1.02 

1.02 

Cost of Goods Sold 

1.00 

1.00 

1.00 

1.00 

0.99 

Expenses 

1.00 

1.00 

1.00 

1.00 

0.99 

Inventory 

1.00 

0.90 

0.90 

0.90 

0.90 

Accounts Receivable 

1.00 

1.00 

1.00 

0.95 

0.95 

Profitability 

Return on Equity 

18.36% 

18.36% 

24.77% 

24.77% 

27.64% 

Return on Net Assets 

15.61% 

15.81% 

18.81% 

19.28% 

20.65% 

Return on Sales 

10.55% 

10.55% 

12.31% 

12.31% 

13.18% 

Net Profit Margin 

5.72% 

5.72% 

7.57% 

7.57% 

8.45% 

Efficiency 

Inventory Turns 

6.57 

7.31 

7.31 

7.31 

7.23 

A/R turns 

3.07 

3.07 

3.14 

3.30 

3.30 

Total Asset Turnover 

0.90 

0.90 

0.92 

0.93 

0.93 

Net Asset Turnover 

1.48 

1.50 

1.53 

1.57 

1.57 


Notes: 

• Scenario 1 represents the base case or the current performance, 

• Scenario 2 represents a 10% reduction in inventory costs from the base case, 

• Scenario 3 adds a 2% gain in sales revenues over the base case to scenario 2 and so on. 

• The profitability and efficiency measures are based on standard financial management definitions. 

Figure 1. Scenario analysis illustrating potential gains from supply chains. 


capabilities to produce and deliver is equally if not more important than the products 
themselves. Consequently, in companies that are market leaders, the building blocks of 
corporate strategy are not products and markets but business processes that constitute 
supply chains (Stalk et al 1992). At the same time reengineering business processes to 
form supply chains, to realize the performance projected in figure 1, introduces formidable 
challenges. 

The nonlinear impact of the individual performance dimensions on overall profitabil¬ 
ity and efficiency is readily apparent from the analysis presented in figure 1. In fact, this 
nonlinearity is inherent in the complex and conflicting trade-off that characterizes the op¬ 
erations of supply chains. The large-scale physical production and distribution network 
for material flow, the uncertainties associated with the external customer and supplier in¬ 
terfaces, and the myriad cross-functional and nonlinear dynamics associated with internal 
information flows, are some of the factors that contribute to the supply chain complexity. 
Lee & Billington (1992) have studied supply chains in electronics, computer, and automo¬ 
bile industries and identified a number of pitfalls that companies face due to the complexity 
of supply chains. Some of these include the lack of supply chain performance metrics, sim¬ 
plistic inventory policies, ignoring the impact of uncertainties, poor coordination among 
divisions constituting the supply chain, and inadequate consideration of inventory and re¬ 
sponse time factors in economic analysis. They note the need to develop an understanding 
of the supply chain complexities to avoid these pitfalls. 

Modelling is fundamental to understanding supply chain complexities and putting the 
insights to practical use in supporting and guiding business process reengineering. Cypress 
(1994) has noted that the first-wave of reengineering (during the first half of the nineties) 
focused on making organizational changes and used primarily information models of supply 
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chains to integrate business processes. In this paper we ai*e interested in the quantitative 
models of supply chains which are likely to have a significant impact in the second-wave of 
reengineering through the deployment of performance and optimization models, economic 
analysis, and decision support systems (Slats et al 1995). From here on modelling refers 
to quantitative modelling. f 

One of the earliest efforts of modelling a supply chain can be traced to Forrester (1958) ! 

who developed a simulation model of a production-distribution system to study its time- 
dependent dynamic behaviour. Another early work by Hanssmann (1959) described an an¬ 
alytical model to determine optimal inventory levels in a manufacturing system composed 
of material procurement, production, and distribution elements. Cohen & Lee (1988) were 
among the first to propose an analytical modelling framework to evaluate the performance 
of supply chains spanning functions from raw material procurement to finished goods de¬ 
livery to customers. They have applied this framework to supply chains of automobile spare 
parts and personal computers (Cohen & Lee 1990). Pyke & Cohen (1990) developed a gen¬ 
eralized Markov-chain model of a multi-product, three-echelon, production-distribution 
network and studied its performance. 

A recent review of OR models in supply chains (Slats et al 1995) notes that most of f 
the quantitative models that are reported in the literature are based on optimization and 
simulation. They observe that utilization of OR techniques and models to analyse the per- ' 
formance of the overall supply chain remain uncommon. The authors conclude that realistic j 

OR models can make considerable contiibution in diagnosing and redesigning business 
processes and facilitate the integration of supply chains. We concur with this observation 
based on our experience of using OR/IE models in reengineering supply chains. i 

One such model was for a large electrical component manufacturer (with an annual i 
revenue of $700 M) who operated 10 final assembly plants throughout the US. Some 
key components were manufactured in-house in two plants in North America. Other raw 
materials were purchased. The 3000 or so active products were then distributed through a 
two-echelon warehouse system, shared by several distribution channels (with each channel 
focusing on a particular market segment). There were also products imported from overseas ^ 

plants owned by the company. The lower echelon alone had five distribution centres serving 
the US market, with a small amount of export. This physical supply chain was managed 
by traditional business functions, such as forecasting, inventory management, production 
planning, transportation management etc., using a commercial supply chain management 
system augmented by in-house developed software. This scenario is fairly representative 
of discrete product manufacturers except for speciality suppliers. The performance model 
for this supply chain is described in § 4.1. 


3. Quantitative modelling and reengineering 

BPR in a broad sense has come to signify the following: 

9 development of a shared understanding of the business processes, 

• streamlining and integrating business functions both internal and external to the orga¬ 
nization. and 
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to manage and support planning, sourcing, making, and delivering activities that constitute 
a supply chain. In practice, companies have taken different tracks to reengineering supply 
chains that will offer high value at low cost. Some companies have concentrated on making 
process improvements v/hile others have focused on upgrading and installing information 
systems. The results on the reengineering activities have been mixed. Gartner Group esti¬ 
mates that nearly 70% of typical BPR efforts fail to achieve the projected results. Although 
BPR efforts hinge on several factors for success, one vital factor is the availability of tools 
and an approach that allows BPR practitioners to effectively communicate among them¬ 
selves as well as with IT professionals who are charged with implementing BPR related 
systems work. We believe that quantitative models and tools can be very effective in this 
context. 

Despite the high visibility and cost of BPR efforts there is a lack of tools to support the 
reengineering activities (Childe et al 1994). A recent survey of the BPR tool market reveals 
that there are slightly over 50 or so programs widely ranging in price and functionality from 
$600 for flowcharting software to $15 K for simulation modelling packages (Barrett 1996; 
Rovira 1996). These commercially available tools can be grouped into three categories. 

(1) Information System departments in organizations have been traditionally using sys¬ 
tem development tools in BPR. These tools are capable of documenting processes 
qualitatively in terms of structured functional and input/output diagrams. The oldest 
and most widely used tool in this categoi 7 is based on the IDEF (Integrated Defini¬ 
tion for Function modelling) methodology that was developed by the US Air Force 
in the seventies for its integrated computer-aided manufacturing program [URL 1]*. 
Extensions to the IDEF methodology have been developed for enteiprise modelling 
(Malhotra & Jayaraman 1992). An example of commercial implementation of the 
IDEF methodology is WizdomWorks [URL 2]. 

(2) Groupware and computer-aided software engineering (CASE) vendors are incorporat¬ 
ing graphical engines for business process modelling. The groupware tools have the 
ability to check consistencies in the process and workflow descriptions while the CASE 
tools provide an object-oriented representation of the business rules and even auto¬ 
matically generate codes for some software implementations. Although these tools are 
clearly an advancement over IDEF which is based on structured analysis and design 
they are only capable of qualitatively capturing processes. A list of workflow-based 
tools for BPR can be seen in [URL 3]. An example of workflow based tool is Business 
Process Modeler [URL 4] and an example of CASE based tool is ERwin/ERX for 
Teamwork [URL 5]. 

(3) At the high-end of BPR tools are discrete event simulation packages that provide 
tailored BPR modelling capability. These tools go well beyond describing the pro¬ 
cess and provide a means to quantitatively model supply chains and evaluate perfor¬ 
mance (Connors et al 1995). These tools include BPMAT [URL 6] developed by IBM, 
ARENA [URL 7] by Systems Modeling Corporation, and Extend -t- BPR [URL 8] by 
Imagine That Inc. The use of Extend 4- BPR is illushated through the well-known 
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and Hansen (1994). Other general purpose simulation packages have also advertised 
BPR modelling capabilities. 


A general list of BPR tools can be seen in [URL 9]. BPR tools with qualitative pro¬ 
cess modelling capabilities are inadequate in reengineering supply chains. The process 
descriptions such as those generated by IDEF are static functional snapshots and are very 
limited in articulating the existing process. Besides, these tools completely fail to capture 
the process dynamics and thereby ignore vast amounts of process knowledge. On the other 
hand, BPR tools with quantitative modelling capabilities can more precisely model pro¬ 
cesses and provide an effective means for visualizing, benchmarking, and articulating the 
BPR results. The role of quantitative modelling can best be appreciated when viewed in 
the context of the BPR approach. 

Figure 2 shows the classical BPR approach (Hammer & Champy 1993). The BPR ap¬ 
proach has four basic stages: mobilization to form a reengineering team and generate a pro¬ 
cess map, diagnosis to develop an AS-IS understanding of the processes and identify areas 
for improvement, redesign to generate process breakthroughs and identify best design from 
several alternatives, and transition to pilot the TO-BE implementation and eventually roll 
out the results. These four steps are typically executed in sequence although there are sig¬ 
nificant iterative dependencies in the diagnosis and redesign stages. One reason for the se¬ 
quential execution of the diagnosis and redesign stages is that most of the existing BPR tools 
that can qualitatively document processes do not support concurrency in these BPR stages. 

In our view quantitative modelling can have an impact in the diagnosis, redesign, and 
transition stages of BPR. Further, quantitative modelling can introduce concurrency in 
the diagnosis and redesign stages as shown in the so-called “concurrent reengineering” 
approach in figure 3. As indicated at the outset, quantitative modelling can serve as a 
compelling and useful framework in documenting the supply chain processes, identifying 
reengineering opportunities, evaluating design alternatives, guiding the selection of the 
best alternative, and deploying tools to implement the design. 

We have encountered in practice many common perceptions among management and 
staff within a company that have impeded the use of quantitative models. One such percep¬ 
tion is that reengineering is a business problem and that quantitative models are not really 
applicable. Business problems are often founded on complex trade-offs. Quantitative mod¬ 
els are particularly suited in identifying the factors that affect the trade-off, ascertaining 
the assumptions, and objectively framing the issues. Another common perception and an 
impediment to the use of quantitative models is that they are difficult to build and com¬ 
prehend and are not process-oriented. With advances in quantitative modelling techniques 
(heuristics, simulation, and optimization), the availability of a number of rapid modelling 
tools, and easy access to powerful computing platforms it is possible to build large scale 
models quickly and animate/visualize them for easier comprehension. This is reinforced 
by examples in § 4. 



I 


i 



4. Use cases of quantitative models in reengineering 


In this section we illustrate the use nf nnarititfltive mnrlelQ in the rtincrnncic reHpQicrn 
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Figure 2. Sketch of classical BPR 
approach. 
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Figure 3. Sketch of concurrent reengineering approach. 
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ntified above, through some use cases. Section 4.1 samples the current state of industrial 
ctice in this area, while §§ 4.2 to 4.4 detail some of our practical experience in applying 
mtitative models. 

Overview of state of industrial practice 

Dply chains are usually complex, and it may not be obvious from the symptoms where to 
pn in terms of rethinking and redesigning the operations. Cypress (1994) observed that 
ny reengineering projects are not successful because of the chosen scope of the project 
df. If one had unlimited resources, the ideal approach would be to start from a blank 
;et and design the business from scratch (Hammer & Champy 1993). More commonly, 
ngineering projects have to be carried out with limited resources of budget and time. It 
ilso very likely that reengineering efforts are initiated when business is not performing 
isfactorily, placing severe constraints on the resources available for reengineering. It is 
refore prudent to diagnose the supply chain quickly and accurately, so that resources 
\ be targeted to areas that will yield the most improvement in overall performance. 

\n early example of using quantitative tools for diagnosing production-distribution 
terns is described in Hax et al (1980). In this study, the production and distribution 
nning, sales forecasting, and inventory management systems of a consumer package 
Dds manufacturer were analyzed. First, a general data analysis was performed to describe 
supply chain (such as what fraction of sales is generated where, and what products 
contributing to what fraction of sales). An analysis of variance and exploratory data 
dysis were performed to analyse the forecast errors. The results give a direction of what 
ecasting strategy would be most beneficial (in this case a product-oriented approach, as 
posed to a geography oriented one was picked). A simple, aggregate inventory model 
s built to examine the potential savings from reduced inventory levels. This model 
)wed that safety stock was the largest component of inventory and identified the factors 
intitatively. 

A more recent example of model-based diagnosis is the study for the electrical com- 
aent supply chain as described in § 4.2. A high level quantitative model was developed 
investigate the dynamic behavior of the supply chain (Bhaskaran & Leung 1996). This 
Tormance model was composed of integrated analytical and simulation model compo- 
its. A set of representative products were analysed and the goal was to determine high 
pact opportunities for an overall supply chain performance improvement. Through a se- 
s of what-if analyses, improvement opportunities were identified and ranked according 
their impact. An important conclusion of this diagnosis was that improving the forecast 
or will not result in the kind of improvement management expected, contrary to popular 
iief within the company. Since the model covered all the key components of the supply 
lin, we found it useful as a documentation for the (then) current system, as well as a 
)1 for redesign. This model is described in more detail in § 4.2. 

Redesigning a supply chain involves either one or both of the following: redesigning 
5 business processes that constitute the management and operation of the supply chain 
i redesigning the physical production-distribution network that executes the material 
w in the supply chain. Performing the latter often implies the execution of the former. 


views. 

A key business process for a manufacturer is order fulfillment. This process defines how 
a company fills customer orders, starting from the receipt of a customer order and ending 
with the physical arrival of the ordered products at the customer specified destination. De¬ 
pending on whether it is an assemble-to-order or assemble-in-advance situation, the order 
fulfillment process can include the actual final assembly of products. An example of using 
quantitative models to redesign an order fulfillment process is described in Feigin et al 
(1996). In fact, in this study, a simulation model of the supply chain of the IBM PC Com¬ 
pany in Europe was developed to redesign the overall order fulfillment from assemble-in- 
advance to a combination of assemble-to-order and assemble-in-advance. New inventory 
planning policies and ways of allocating constrained components were introduced. The 
simulation model was used to quantify the benefits of these new processes and identify 
conditions under which they will be most beneficial. 

In a similar spirit, simulation models developed with BPMAT, were used in a number 
of reengineering projects in IBM and others [URL 6]. For example, it was used in inves¬ 
tigating opportunities for cycle time and cost reduction of a purchasing process. Various 
staffing and outsourcing alternatives were modelled during the process redesign stage. 

In the automobile industry, BPMAT models were developed in reengineering a wiring- 
harness predesign process and an engineering change implementation process. A major 
goal of both these efforts was to reduce the cycle time involved. 

On the supply side, a key process is production planning and scheduling. Fisher et al 
(1994) describe how a fashion-ski-apparel supplier reengineered its supply chain to cope 
with a changing business environment marked by uncertainty in demand. The company 
shortened its order processing times, changed the way it purchased raw materials, per¬ 
suaded their customers to place orders sooner, and adopted a new forecasting and produc¬ 
tion scheduling approach. In particular, a quantitative model was developed for the purpose 
of creating a production schedule that minimized the risk of making a non-popular product. 

The model could also be used to analyse parametric changes to the physical supply chain, 
such as production capacity. The model was used to guide the continual refinements to 
the supply chain and product redesign process. It illustrates the concept of “concurrent 
reengineering” where diagnosis and redesign are done iteratively. 

In Feigin et al (1996), the authors mention a new production planning process that takes 
into account constrained availability of components. Instead of performing a traditional 
MRP type calculation driven by finished goods requirement forecast, the production plan 
was formulated based on the most desirable allocation of the constrained materials (e.g. 
for maximizing revenue, fair allocation, product priority). An optimization model was i 
developed to support this decision (Dietrich et al 1995). 

In design/redesign of physical production-distribution network, the use of quantitative 
models are better established and a number of commercial model-based tools are available. ^ 

An early practical work in this area is that of Geoffrion & Graves (1974) (see also Geoffrion ; 

1976). In this study, the problem of selecting sites for distribution centres, their sizes and 
customer zones, and the transportation flow patterns is modelled by a mixed integer mathe¬ 
matical program. The basic mathematical programming model was used by Hunt-Wesson 
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ds who were interested in conducting a systematic study of the entire distribution sys- 
to investigate opportunities for improvement and to resolve expansion and relocation 
es. On the basis of the model recommendations, some existing distribution centres were 
cated and new distribution centres were opened, while realizing significant annual cost 
ngs. Recently, Pooley (1994) extended the basic model to include production facilities 
study for Ault Foods in Canada. Decisions related to production, such as what products 
X be produced in which plant and what plant should supply which distribution centre, 
e considered simultaneously with distribution decisions. 

imilar in spirit, but larger in scale, to the Ault work is the global reengineering project 
ied out by Digital Equipment Corporation to restmcture its supply chain on a world- 
e basis (Arntzen et al 1995). Changes in the business environment such as rapid ad- 
:ement of small computer and communication network technology led Digital “to 
vent itself” in order to survive in the new environment. It changed its conventional 
tegy of high vertical integration and focused on several core technologies. For exam- 
it stopped producing power supplies and cables but retained semiconductors, modules, 
systems. At the same time, the overall level of business decreased and the demand 
em had changed from moderate numbers of complex orders for large systems to large 
ibers of PCs and workstations. It was clear that Digital’s supply chain needed to be 
^signed. A large scale quantitative model was developed to aid this process. The model 
sidered component supply (which vendors from where), production (which plants mak- 
what), and distribution (what customers to serve from where). It minimized a weighted 
ibination of total costs and cycle times. A distinctive feature of this model is the consid- 
ion of global aspects of the supply chain, such as duties, taxes, trade and local content 
lirements. Besides redesigning the supply chain for existing products, the model was 
» used to configure supply chains for new products. The model was formulated as a 
ed integer linear program. 

i. model of a similar flavor was developed at General Motors (GM) earlier, originally 
demented in 1974 (Breitman & Lucas 1987). The original intention of the model was 
elp determine the best strategies for placing facilities to support new products in new 
•kets but was quickly expanded to include decisions such as those faced by Digital. 

; model was solved by (then) commercial mixed integer programming packages such 
dPSX or SCICONIC (Sharda 1995), and was embedded in a decision-support system 
1 business-oriented user interface. 

rt a more detailed level, the Delco division of GM performed a redesign of its distribution 
vork by investigating its shipping strategy (Blumenfeld et al 1987). The goal was to 
jce the inventory and transportation costs of Delco parts supplied to GM assembly 
Its. Alternatives included shipping parts from Delco plants directly to GM assembly 
Its, through a Delco warehouse, a combination of the two, or delivering to several GM 
Its with one truck load. A quantitative model was developed not only to identify the 
imal strategy, which can serve as a benchmark for practical but simpler strategies, but 
) provide an objective means of analyzing the trade-off between transportation and 
mtory costs. The optimization model was solved by mathematical decomposition of 
distribution network into individual links (Blumenfeld et al 1985). 
n recent years, there have been a number of commercial software packages devel- 




considered by Geoffrion & Graves (1974), and Pooley (1994) as mentioned before. These 
include PHYDIAS by Bender Management Consultants, Logistics Toolkit by CAPS Lo¬ 
gistics [URL 10], and Locate 3+ by Cleveland Consulting. Most of these packages are 
based on solving a large scale mixed integer or linear mathematical program. A compre¬ 
hensive survey of commercial tools in this area can be found in Leung (1995) and Ballou 
& Masters (1993). Although not designed for business process reengineering per se, they 
are very useful in reengineering physical supply chain networks. In addition, general pur¬ 
pose operations research/optimization toolboxes or packages can and have been used to 
solve distribution network design problems. Gangoli & Jenkins (1988) report using SAS 
[URL 11] to improve the distribution network of Warner-Lambert as part of its strategic 
planning process. 

Sometimes it is not necessary to use advanced optimization models for disti'ibution 
network design. Mercer & Tao (1996) used a relatively simple dynamic model to study 
alternative distribution policies of a food manufacturer supplying Tesco, a major grocery 
retailer in the UK. The model helped in deciding how products would be transshipped 
between the manufacturer’s finished goods warehouses to satisfy Tesco’s needs. 

In the last several years, a notable development related to reengineering supply chains 
has been the concept of product design for supply chain management (Lee 1993). It is also 
known as design for logistics (Mather 1992). Similar to the well-known concept of design 
for manufacture, design for logistics is concerned with product design that allows effective 
delivery of products to customers. In particular, a key approach is delayed product differ¬ 
entiation in which the differentiation features within a product family are manufactured 
into the products as late as possible. For this purpose, an existing product may have to be 
redesigned or the design of a new product may result in higher material/manufacturing 
cost. In addition, semi-finished products carried in intermediate stockpiles may now bear 
a higher per unit value. It is important to analyze whether the resultant savings in supply 
chain costs (such as reduced finished goods inventory) and other benefits (such as increased 
flexibility leading to better customer service) can offset the incremental cost. Quantitative 
models are useful to this end. Lee (1996) presented two inventory performance models 
to support product design for delayed differentiation, and their real application cases (see 
also Lee & Billington 1995). 

One area that we do not see much reported in the technical literature is the use of 
quantitative models in the transition phase of a reengineering project. After a new process 
has been finalized, a roll-out is required to phase-in the new process. A key factor for the 
success of the new process is the training of staff who either manage or operate the process. 
Because the nature of reengineering stipulates that the new process may be totally new to 
the organization, management by experience alone is often not adequate. Here quantitative 
models of the process developed during the design phase can be valuable in familiarizing 
the staff with the process and providing management insight on the process dynamics. 
The latter cannot be gained from static models such as process charts or IDEF diagrams. 
Simulation models with graphical user interfaces are highly preferable. 

Another important aspect of transition is the development or reconfiguration of infor¬ 
mation systems (transaction processing and decision support) to support the new process. 
Quantitative models developed during design are now useful as a building block for de¬ 
cision support systems or as a technology enabler of the new process (see § 4.3). Once 
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model is embedded in a decision support system, it can be used routinely for contin- 
improvement of the process. Furthermore, quantitative models are very valuable in 
ing operational or tactical planning systems. Typically, an operational system such as 
entory/distribution requirements planning has many user specified parameters. While 
parameters together make the system very flexible, it is often not obvious how they 
•uld be set to meet the business objectives. A quantitative model that captures the dynam- 
of the system (e.g. the one mentioned in § 4.2) is a tool to help optimize the parameter 
ues. 

A supply chain performance model 

nagement of supply chains that are engaged in the production and distribution of goods 
uire a number of key management functions such as demand forecasting, inventory 
nning and control, production planning, manufacturing, and distribution. Ideally, these 
nagement functions work in cooperation to transform raw materials to finished goods 
1 deploy them appropriately to meet the needs of the market. The presence of a large 
duct portfolio, diverse markets, and uncertainties associated with the supply and de- 
nd processes are some of the prominent factors that contribute to the complexity of the 
ividual management functions as well as their coordination in the overall management 
i supply chain. Obviously, such complexity tends to make the decision making process 
icult and challenging at the strategic and tactical levels of supply chain management, 
vlanaging this complexity requires the visibility of how local decisions, within each 
nagement function, affect the overall performance and the competitive advantage of the 
iply chain. Supply Chain performance models provide insight on the impact of local 
isions on global perfonnance by exercising an aggregate quantitative dynamic model of 
supply chain with various decision scenarios. We describe such a quantitative supply 
Lin performance model. The development of the model was pait of a larger effort in 
Droving the profitability and customer service of the business thi'ough better strategic 
1 tactical decision making. The model development included: Interviewing decision 
kers responsible for strategic and tactical decisions, gathering information on how the 
dness operates, understanding the management requirements, and then building and 
idating the quantitative model. 

rhe supply chain that was modelled is shown in figure 4. It is a scaled representation of 
real supply chain and consists of the following physical elements; Ten representative 
iducts, three critical components (i.e., raw materials for final assembly) that are required 
the production, a factory comprising two machine groups where the products are 
•duced, a source warehouse that accepts the factory output, and a distribution system 
nprising ten field warehouses located all over the United States to cater to the national 
rket. The supply chain comprising the aforementioned elements is managed collectively 
the following essential management functions; Demand forecasting, inventory planning 
1 control, production planning, manufacturing, and distribution, 
rhe scaling of the actual system, to derive the supply chain shown in figure 4, was 
;essary to define a manageable scope for the quantitative modelling exercise. It was 
le in cooperation with the management personnel of the supply chain to ensure that the 
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WAREHOUSES 

Figure 4. Schematic of supply chain infonTiation and material flows. 


A detailed study of the overall operations was also conducted to understand the individual 
management functions and how they collectively worked within the supply chain in the 
real world. 

Operationally, the supply chain can be classified as a make-to-stock business that is 
forecast driven. Specifically, no customer requests for products are directly entertained at 
the factory. As a consequence the supply chain can be viewed as a large scale push-pull 
system. The pull and push parts are referred to as the commercial and the manufacturing 
processes respectively. The commercial process is engaged in “selling” products to cus¬ 
tomers from the inventory that it previously “bought” from manufacturing. The quantity 
it “buys” at any point in time is governed by the forecast for product demand as well as its 
policies with respect to hedging against demand and stock replenishment uncertainties. On 
the other hand, the manufacturing process strives to make-to-stock based on the anticipated 
needs of the commercial organization. The push and the pull operations are phased apart 
by the manufacturing lead time, which is the time in advance that manufacturing plans to 
meet the future needs of the commercial process. 

The development of the performance model involved the following steps. 

(1) Defining the supply chain: This includes the structural constraints of the supply chain 
model, i.e., what the elements of the supply chain (physical elements such as the source 
warehouse and the management functions such as production planning) are, and how 
they operate as part of the supply chain. 
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model and constitute an independent set of variables (that are different from other 
internal variables of the model). They are chosen based on the study of the real sup¬ 
ply chain and in consultation with management personnel who are the end-users of 
the performance model. Some of the management variables include the forecast un¬ 
certainty for the demand forecasting function, planning frequency and time fence for 
the production planning function, lead time and delivery reliability for the component 
supply, run quantities and standard efficiencies for the factory, transit lead times for the 
distribution, and order quantities and service levels for the inventory planning function. 

Specifying the model output: This involves the specification of the performance mea¬ 
sures that will be used to measure the performance of the supply chain, upon execution 
of the model. The supply chain performance is measured using pre-defined perfor¬ 
mance measures that are categorized as inventory-based and time-based. Inventory- 
based measures include on-hand inventoiy throughout the supply chain, inventory 
turn, and fill rates. Time-based measures include the customer order lead time, the 
manufacturing lead time and the overall system response time. 

Characterizing the elements of the supply chain: Mathematical models of the manage¬ 
ment functions in the supply chain (shown in figure 4) such as demand forecasting, 
production planning, and inventory planning, and the physical elements such as the 
factory, the source warehouse, and the field warehouses ai’e developed. These models 
ai-e sufficiently detailed to capture the impact of the management variables (item 2) on 
the performance measures (item 3). 

Developing the quantitative model: The individual mathematical representations of 
the elements in the supply chain are unified in a mathematical framework taking into 
account the various constraints. Such a quantitative characterization of the supply 
chain is refemed to as the performance model. 

It may be noted that the two flows that essentially integrate the supply chain are the 
Lterial and information flows. The dynamics of these flows are shaped by the external 
:tors such as the customer demand as well as the local decisions made by each manage- 
:nt function. The two basic attributes of the dynamics are quantity and time. Recall that 
; performance measures for the model are also aptly categorized as inventory-based (i.e. 
antity-based) and time-based. Any integrating mechanism must therefore feature these 
0 basic attributes. Additionally such a mechanism must be consistent with the models 
the individual elements of the supply chain. The two attributes, namely, the quantity 
i the time, are characterized stochastically. 

The model was validated with real data and presented to decision makers in the form 
a decision support system with interactive user interface. The system allowed decision 
Lkers to run alternative supply chain scenarios by changing the management variables, 
ese scenarios could then be compared in terms of the performance measures. The system 
[S used in the diagnosis of the overall supply chain as well as in streamlining the existing 
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4.3 Reengineering the supply chain 


As mentioned in § 2, the supply chain depicted in § 4.2 was managed by function (sales, 
customer service, forecasting, inventory planning, production planning etc.) and further 
by market channel (retail, wholesale, industrial, export etc.) on the sales and service side. 
In such an environment it was not surprising to find the following key observation in the 
diagnosis phase of an effort to reengineer the order fulfillment process; The process was 
fragmented with too many hand-offs of both information and physical product, and did 
not present a consistent, unified interface to the customer. A large part of the redesign was 
therefore devoted to address this issue. 

One candidate design for a new order fulfillment process involved the integration of most 
of the customer interface and front end planning activities into a new sub-process, called 
the logistics account management process (LAMP). Specifically, LAMP included taking 
and delivering customer orders, and was responsible for ensuring that there was adequate 
stock in the distribution network to fulfill orders. This meant that sales forecasting and 
finished goods inventory planning would be performed in the same process that dealt with 
customer orders. Knowledge on customer demands would reside within this process. A 
new class of jobs, called logistics account managers, had to be created to man the LAMP. 
The job of logistics account managers was composed of tasks then carried out by several 
functional departments, and a few new activities which were not performed before. New 
skills had to be acquired by either training of existing employees or through new hires. A 
business issue was then how many of such logistics account managers were needed and 
what customers should be allocated to each team in terms of workload. Would the new 
process be too expensive (in tenns of trade-off between the new supply chain capability 
and any additional cost)? 

In the initial design stage, the principal process flows in the LAMP were captured 
in a queuing network model using a commercial software package, MPX by Network 
Dynamics. The objective of the model was to: 

(1) establish process feasibility, in terms of staffing levels and customer order response 
times; 



9 



(2) aid the design of customer portfolios, from the view point of work content and load I 

balance of candidate portfolios; [ 

(3) support the development of detailed process design. 


A representative customer region was selected for this initial design study. For input to 
the model, relevant historical demand data were obtained from corporate databases and 
work flows of existing tasks and estimates of their times were gathered from record files 
and interviews with appropriate personnel. In establishing process feasibility, quantitative i 
results from the model were provided to the reengineering team and company management 
to help decide on whether to continue with this design. Some examples are: ^ 

(1) minimum total number of logistics account managers required for the customer re- . 
sponse desired; 
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existing marketing organizational structure (i.e., each channel would have its own 
team of logistics account managers); 

number of logistics account managers required if some tasks were to be assigned to a 
few specialists (this is a mixture of the current process and the new design). 

br strategic reasons the existing marketing structure (by channel) would be maintained, 
h each channel responsible for its own profit and loss. However, to retain the manufac- 
ng economy of scale (important for a low margin business we were in), the product 
ply would be shared by the different marketing channels. To maximize risk pooling in 
new design, finished goods inventory would not be “tagged” for a particular channel 
il it was used to fill a customer order, even though the channel planned for it through the 
MP and was independently responsible for their finances and customer service mea- 
ements (option 2 immediately above). This created a problem of how to allocate the 
entory costs to the different channels. To this end we developed a quantitative inventory 
asurement model, whose role in the proposed inventory management architecture is 
wn in figure 5. 

Trst, the demands for all products were analysed and allocated as a “shared” product 
a “private” product to a channel. The criterion for allocation was a specified fraction 
sales handled by a channel and a threshold total sales volume. All inventory costs of 
/ate products would be carried by the individual channels that owned the products, as 
se costs were (relatively) too small to be further broken down. For the shared products, 
inventory measurement model was applied. The model was based on decomposing the 
il inventory into logical stockpiles according to the reasons they were there, i.e., the 
ck drivers. Figure 6 shows the logical stockpiles and their measurement principle. At 
end of each accounting period, the measurement model would be executed to allocate 
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Stock driver 

Inventory contribution 

Measurement principle 

Production Inflexibility 

Supply Stocks 

Attributed to Supply Process 

Distribution Pipeline 

Cycle Stocks 

Attributed to channels based 
on Sales Volume 

Supply Lead time Variance 

Safety Stocks 

Attributed to Supply Process 

Demand/Forecast Variance 
at Product Level 

Safety Stocks 

Attributed to channels based 
on individual demand/forecast 
variance 

Service Levels 

Safety Stocks 

Attributed to channels based 
on target service levels 


Figure 6. Inventory categorization and measurement basis. 


inventory costs to the different channels. An important feature of the model was that it was 
completely transparent to the goods flow and did not impose any burden on the day-to-day 
operation of the supply chain. The model could be completely automated with required . 
inputs that were already being measured from the physical system. The model served as ^ 
a technology enabler of the new process design and, more importantly, provided valuable 
performance feedback to the management of the different channels. 

f 

t 

4.4 A production supply chain I 


A single production plant can also be a supply chain with daunting complexities. We 
consider a North American display product manufacturer with annual revenues of $600 M. 

The production process is a hybrid of consumer electronics production (in terms of flexible 
flow through parallel banks of workstations that operate in tandem with issues of major and 
minor changeovers) and semiconductor production (in terms of batch processing stages, 
reentrant flows, and associated yield uncertainties). Complicating this scenario was an ^ 
uncertain demand, vagaries in the supply of components, and a multi-level bill of assembly 
shown in figure 7 with each level having its own detailed and complex process route through | 
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itwork of machine groups. The facility produced three basic families of products, 
varying product mix also introduced contentions between these product families for 
:hine capacities. 

he facility was plagued with poor delivery performance and having difficulty coping 
1 the process complexity, machine downtimes, varying demand and product mix, and 
ertain component supply. In short, the production planning function had an inadequate 
erstanding of the capacity of the plant and was unable to plan effectively. The plant 
lagement was interested in reengineering the facility and specifically in evaluating 
?pective factory layouts. This included outsourcing portions of the overall process and 
amlining the process flow in terms of efficient load balancing and process batching, 
ddition to these stmctural and manufacturing execution changes, the management was 
I eager to upgrade the production planning capabilities of the plant in terms of decision 
port tools. 

ypically, capacity calculations that accompany production planning are done using 
jadsheets which provide a static snapshot of individual machine capacities. The un- 
ainty associated with the''number of starts, the yield, breakdown of machines, and the 
iplicated process flow are difficult to capture in a static rough-cut capacity calculation, 
he other extreme, detailed simulation models can capture the process dynamics but are 
;n too cumbersome to build and too computationally intensive to be of any use as an 
ractive decision support tool. An efficient and expedient way to model capacity at an 
regate level, that takes into consideration the shopfloor uncertainties and the process 
amics, is using queuing networks (see figure 8). 

Ve used the MPX software to model the process flow based on queuing networks. The 
val and departure process to each workstation, the process time at the workstation, the 
Auction yield, and the workstation reliability are stochastically represented. We then 
/e- a number of aggregate dynamic equations that collectively characterize the queu- 
network. The theoretical basis for the aggregate dynamic models and solution of the 
uing network can be seen in Suri et al (1993) and Whitt (1983). Optimization models 
•e also separately built to balance the load across banks of parallel heterogeneous work- 
ions with special flow constraints. A decision support platform was built unifying the 
'X and optimization models and providing an interactive user interface for the analyst, 
^he input for the aggregate capacity model was obtained from various sources. Process 
V data were obtained from process layout drawings and based on tours and actual ob- 
/ations of the production process. Process times were estimated based on information 
vided by production personnel. Process yields were similarly estimated. Machine reli- 
lity and changeovers were partly estimated and partly based on historical information. 
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Data on material handling devices such as conveyors and monorails that transported goods 
between workstations were obtained from technical documentation on operating proce¬ 
dures. Historical demand data were also obtained to characterize the product mix and the 
arrival process to the queuing network. 

The MPX model itself served as a practically useful and comprehensive data dictionary 
for capacity planning and resource allocation. The model served to integrate the fragmen¬ 
tary data that were stored in various media and validate it in the context of capacity and 
production planning. Further, the data were now in a form that allowed the factory manage¬ 
ment to easily evaluate prospective factory layouts as part of its reengineering activities. 
The specific outputs of the capacity model include long-run average performance for indi¬ 
vidual products in terms of production achieved, yield, flow time, and work-in-progress. 
The output also includes utilization for machines as well performance for the overall plant. 
The production planning personnel envisaged the use of this model for forward (given the 
demand mix how long it takes to produce) and backward (given the horizon what is the fea¬ 
sible production mix) production planning. Additionally, the model can be used in decision 
support mode for bottleneck, yield, lead time, and work-in-progress inventory analysis of 
either the overall plant or any segment of the plant. 


5. Concluding remarks 

Quantitative models are very relevant in the diagnosis, redesign, and transition stages 
of reengineering. They bring concurrency to the diagnosis and redesign stages. Further, 
the quantitative models provide an objective basis to debate the assumptions and derive 
alternative designs. Even after the best design is picked the models can be used as part of 
on-going decision support and planning tools for supply chain management. 

Supply chains are of growing interest among businesses as is evident from the rapid 
growth of supply chain related IT businesses. Following the first-wave of reengineering, 
organizations have invested heavily in Enterprise Resource Planning (ERP) software such 
as SAP [URL 12], Baan [URL 13], and BPCS [URL 14] to integrate enterprise-wide 
transaction data that is contained in heterogeneous databases. At the same time supply 
chain operational planning software has emerged to offer point solutions such as inventory 
planning, forecasting, deployment planning and transportation management. In the second- 
wave of reengineering, organizations ai'e eager to have the various operational planning 1 
software inter-operable by integrating them in a flexible manner. Currently no supply 
chain architecture or supply chain reference model exists that will serve to guide the 
integration process. At the same time the tactical and strategic planning levels of supply 
chain management are inadequately served. Quantitative modelling, such as that founded 
in OR/IE, can serve a useful purpose to fill this void. 

However, traditional OR/IE modelling with sole emphasis on models and data structures 
will not be adequate (Slats et al 1995). The models will have to be viewed in the larger 
context of distributed decision support solutions for supply chain management. The models 
must therefore be designed to support multiple users to support cross-functional decision 
making. Additionally, the models must be flexibly designed to be reusable and applied 



ion to data communication and processing requirements imposed by the model and the 
ition strategy. 

Quantitative modelling, knowledge of business processes that define real supply 
ins, and information technology, collectively constitute a powerful and compelling 
ision technology base for reengineering, integrating, planning, and optimizing supply 
ins. 
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Abstract. Manufacturing and design are very closely related. The manu¬ 
facturing capabilities available impact the scope of design, while design for 
manufacturing ensures the economic success of the products. 

Furthermore, the major goals of firms in the nineties are to significantly 
reduce product costs and time to market (TTM). To meet these goals, quality 
product designs that meet customers’ needs have to be developed. 

Design methods such as Quality Function Deployment and Pugh’s concept 
selection technique have been used to significantly improve engineering design 
processes. Developed as separate tools, however, they are difficult to integrate 
and coordinate in the total design process, since the relationship of the two 
methods is often unclear. 

This paper demonstrates, through a case study, that these methods are in 
fact results of a similar underlying concept. Design problems and solutions are 
unified by the concept of engineering models. An engineering model is a set 
of equations that relates the design variables to the performance metrics used 
to quantify performance of a product. Together with the engineering models. 
Quality Function Deployment and Pugh’s concept selection technique have 
been used in the design and development of a hematology machine from concept 
to prototype. 

Keywords. Integrated product development; quality function deployment; 
concept selection technique; time to market; product costs; hematology 
machine. 


Introduction 

relationship between manufacturing and design has been well recognized. A study 
e by Westinghouse (1984) shows that 80% of all life-cycle costs are fixed during the 
[gn phase. Good manufacturing techniques cannot compensate for poor design, 
urther, in this highly competitive age, for a company the ability to identify products 
the customers need is crucial. The degree to which a product satisfies customer desires 

rWoMllPt On/^r^ACC /T-Iico 1 QQA^ A ^/^ncor»c«nc« to T*ot-\t/^lTr rr 




in industrial practice that customer desires can only be obtained through actual contact 
with the customer and that designers are often wrong when they try to guess what the cus¬ 
tomer wants (Rabino & Muskowitz 1984). To facilitate customer focus, several structured 
methodologies and representations for organizing and presenting customer information 
have been developed. One such representation is the House of Quality (HOQ), which 
helps product designers to explicitly identify customer requirements, relate them to ob¬ 
jective engineering characteristics, identify tradeoffs, and to evaluate the characteristics of 
a potential product relative to competing products (Hauser & Clausing 1988). The HOQ 
provides a product development team with a compact description of three important items 

(1) Customer needs and their relation to objective engineering characteristics. 

(2) Comparisons with competing products based on objective engineering numbers. 

(3) A summary of the engineering tradeoffs inherent in the design. 

In a typical situation, marketing staff collect the data about customers and competing 
products and, possibly with some input from engineering, create the HOQ. They decide a 
set of performance targets which are then communicated to the designers. There are two 
specific sources of problems in this process, 

(1) Targets set based on the information contained in the HOQ alone are often unrealistic. 
As a result, designers cannot achieve them and this results in time- consuming iterations 
until an achievable specification is reached. 

(2) The way coupling between design variables is described in the HOQ does not ade¬ 
quately reflect the trade-off that must be made in real design problems. 

After understanding the voice of the customers, concepts are generated using brain¬ 
storming and these concepts are selected using the method proposed by Pugh (1991). This 
will be described in a later section using a case study. 

In addition to House of Quality and Pugh’s Selection Method, designers often have 
reliable engineering models that they can use to test the limits of product performance. An 
engineering model is a set of equations that relates the design variables to the performance 
metrics used to quantify performance of a product. 

The synthesis of the methods mentioned above is shown in figure 1. 


2. Quality function deployment. The house of quality 

The identification of customer needs remains a key aspect of developing robust and success¬ 
ful products. Without customer needs to guide new product design decisions, the products 
themselves have relatively low probabilities of success in the market. The three key steps 
that allow a researcher to construct the voice of the customer are as below (figure 2). 

(1) Identify unstructured list of customer needs. 

(2) Set development priorities by quantifying the relative importance of the different needs. 

(3) Translate the quantified needs into specific engineering goals. 


Integrated product development 


191 



Figure 1. Integrate product 
design process. 


further, it remains critical that the exact words and vernacular be used throughout the 
ntification, structuring and prioritization process. Without using the exact words of the 
tomer, it may become difficult to identify the nuances of the customer preferences; if the 
tomer voice is constantly processed and re-translated, the final translation may become 
rkedly different from the customer’s initial intentions. Because of this need to capture 
exact words of the customer, one-on-one interviews and focus groups provide a superior 
dium other than sales force or distributor reports for producing a rich set of needs, 
mrther, the interactive one-on-one interviews may provide a more robust set of needs 
n focus groups because of the opportunity to probe deeper for customer needs, rather 
n focusing on the obvious issues which a focus group can easily identify. 

Mauser and Griffin (Griffin 1989) have also carried out research on identifying how many 
itomers need to be interviewed in order to identify a percentage of the unmet needs. They 
^e created a Beta-binomial model which predicts the percentage of the needs identified 
en the number of customers interviewed. Based on the model predictions, interviews 
:h sixteen customers identify 80% of the needs, while thirty interviews would be required 
dentify 90% of the needs. Considering the high cost of market research, significant cost 
ings can be realized bv reducing the number of surveved or interviewed customers. 
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Process Steps 


Process Tools 



• Experiential Interviews 
with Med Techs 

• Patent Searches 


• Experiential Interviews 
with Med Techs 

• Focus Group with 
Med Techs 

(large physician practices) 


• Card-sort Interviews 
with Med Techs 


• House of Quality 
with Research Team 


Figure 2. Market research process - 
identifying and applying the voice of the 
customer. 


Hauser and Griffin’s model was used to determine the number of customers that we 
should interview. Through experiential interviews, card-sort interviews, and the focus 
group, the research team has directly contacted 33 medical technicians who were involved 
in hematology testing. Based on Hauser and Griffin’s model, we should have identified over 
90% of customer needs. Some of the results are shown in tables 1-3. Due to proprietary 
considerations, not too many details are given on this project in this paper. However, the 
nature of the'project is to design a next generation of hematology equipment for a major 
corporation in the United States of America. 


Table 1. Ten most important needs for physician-practice and medical technicians. 
Importance Customer need 

1 Achieves level of accuracy equivalent.to competitor’s machine 

2 Accurate even if the blood clots 

3 No opening of sample tubes 

4 Machine that can be stopped for RUSH job or emergency job (‘STAT’). 

5 Low sample volumes for automatic testing 

6 Analysis time less than a minute 

7 Eliminate opening tubes to solve aerosol problem 

C __ _ *__ _ t-l _11.. 
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le 2. Examples of translations from customer needs to engineering design requirements, 
tomer needs Engineering design requirements 


Time to remove cover 

Number of steps to take apart for repair 

Height, width and depth of the machine 


:hine covers that are easy to remove • 

ainate error in patient identification • 


re quality control information • 

ninate weekly or long-term maintenance • 

Hour service available • 


ilyse blood in less than a minute 


Number of human failure points 
Amount of information on tube 
Level of redundancy 
Labeling time 

Patient identification system cost 

Read and process time for patient identification 

Data storage capacity 

Number of computing resources and functions used 
Number of external access routes 


Repair time 

24-Hour maintenance contract availability 
Mean time between failure 
Lead time for repair 

Centrifuge spin time 
Scanning time 
Analysis time 
Material handling time 

Read and process time for patient identification 
(result) Print out time 


Weekly cleaning time 

Mean time for preventive maintenance 

Downtime for preventive maintenance 


Fhis QFD process is most effective when collective experiences of a multifunctional 
m are greater than the sum of the individual experiences. Therefore, the decisions 
:de jointly by the team will consider all the important issues of customer needs, robust 
hnology, manufacturability, and external design. 

rhe relationship matrix in the House of Quality (Hauser & Clausing 1988) helps the 
m members to clarify the relationships between requirements and output and serves as a 
erence document as the design process evolves. The roof of the HOQ provides informa- 
n on the relationships among the engineering characteristics. Additional columns and 
vs on the HOQ provide summary information on the technical difficulty of improving 
^ineering characteristics, and the relative position of competitors. 


Pugh’s concept selection technique 

e Pugh’s concept selection technique provides a systematic process for choosing the 
^st” design out of several, based on attributes that are previously determined. The Pugh 
)cess guards against potential weaknesses that might be due to lack of thoroughness 
conceptual selection. The process forces the team to consider all types of possible 
icepts, explain the potential solutions in sketch form, and place the solutions along 

; horizontal axis of thp splpotinn matrix- thp. tpam then iHpntifips rpipvant rritp.ria for 
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Table 3. Most important design requirements in physician practices and hospitals (10 for most impoi 
and 1 for least important). 


Engineering design requirements 

Physicians 

Hosp 

Increase variability of batch sizes 

4 

4 

Reduce number of steps before walkaway 

6 

6 

Reduce number of steps on return 

4 

4 

Reduce required time between loading 

4 

4 

Include password for supervisor 

5 

i 

Include alternate technology 

5 

t 

Reduce time for maintenance 

5 

< 

Increase automatic maintenance level 

4 

4 

Reduce number of human failure points 

6 

1 

Reduce material handling time 

4 

4 

Reduce downtime for preventive maintenance 

4 


Reduce number of blood exposures 

5 

4 

Reduce number of aerosols 

5 

t 

Reduce number of times disposables handled 

4 

L 

Reduce number of sharps handled 

4 

L 

Increase data storage capacity 

5 

t 

Reduce number of computing resources and functions used 

5 

t 


the concept selection which could come from either customer input or expectations ol 
design team. 

A datum concept is then selected with which all other concepts will be compared ag£ 
(often, the datum concept is the currently existing design). After establishing the dal 
for each individual criteria the other concepts are rated as better (+), worse (—) oi 
same (S) as the datum. After completing the matrix, it is possible to graphically > 
which concepts best satisfy a wide range of criteria. Based on the matrix, it is posj 
that a dominant concept may emerge; it is also possible that some concepts will ha 
number of relative strengths and weaknesses. Hybridization of the concepts can be pli 
alongside the initial concepts and “re-pughed” with the same criteria. Thus, this pro 
provides a systematic method of acquiring: 

(1) greater insight into specification requirements; 

(2) greater understanding of the problem; 

(3) greater understanding of potential solutions; 

(4) knowledge of why one concept may be stronger than another; 

(5) team consensus on all of the above; 

(6) a natural stimulus to produce additional concepts. 

Moreover, in addition to improving communication across different members in a pn 
team, QFD provides a format for documenting decisions that can be used to guide fi 
decisions. All of the documents produced by the QFD process can be used as documentj 
for design decisions and product modifications. As a design project progresses, the 
often confusion, and second-guessing as to why a previous decision was made. The I 
and Pugh selection matrices can immediately provide information on the rationale fo 
previous decisions. Figure 3 shows the results of one of the Pugh matrices for pa 
identification. 
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Patient Identification 

Concept Selection 

Criteria 

“Shipping 

Tag” 

“Bracelet 

& 

Tube” 

“Form 

& Tube 

Sticker” 

“Fortti 

& 

Tube” 

“Existing" 

“Auto” 

“Coded 

Tube & 

Form" 

System Cost (-) 

+ 

s 


+ 

+ 

- 

+ 

Manual human 
interaction with tube 
and/or form (-) 

- 

+ 


s 

- 

+ 

s 

Number of potential 
failure points from 
luman interaction (-) 

s 

+ 

D 

s 


+ 

s 

Cost per sample - 
disposable cost (-) 

+ 

+ 

A 

s 

+ 

+ 

s 

Machine can read 
patient identification 
(+) 

- 

s 

T 

- 

- 

s 

s 

iase of human reading 
on tube (+) 

+ 

s 

U 

s 

- 

+ 

s 

Reliability in reading 
patient identification 
(+) 

s 

s 

m 

s 

- 

+ 

s 

Potential patient 
interference with 
blood band and 
measurement 
operation (-) 


+ 


+ 

+ 

+ 

s 

Laboratory 
information system 
(LIS) dependency 
(-) 

+ 

s 


+ 

+ 

+ 

+ 

SUM 

0 

+4 

0 

+2 

-2 

+6 

+3 


Figure 3. Patient identification Pugh chart. 


Engineering models 

ay firms designing products, ranging from bearings to automobiles, have developed 
ineering models for their products. An engineering model is a set of equations that 
ites the design variables to the performance metrics used to quantify the performance 
L product. 

"hese models are used to decide whether designs are feasible, to explore the perfor- 
ice envelope of a design without actually building a physical prototype, and to study 
tradeoffs involved in the design. A reliable engineering model is often a key part of 
competitive advantage in product design. This is because it allows the designer to 
eriment with large numbers of possibilities without increasing the design time. Hav- 
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Motor Type i 


Battety Type ( 



Duty 

Cycls 


DESIGN _^ INTERMEDIATE Causality _PERFORMANCE 

VARIABLES ^VARIABLES ‘^METRICS 

Figure 4. Structure of engineering model for precision centrifuge. 

completed rapidly and reliably on the computer without the delays and effort required to 
build and test prototype hardware. 

Figure 4 shows the structure of the engineering model for the precision centrifuge that 
is required in the design of the hematology equipment. 

The inputs to the model are: 

(1) Motor torque constant (Kt), internal resistance (Rm) and mass (Mmot), rated load 
(Pr). All these get fixed when a particular motor type is chosen; 

(2) Battery resistance/cell (i?ceii)> battery energy/cell (jBceii) and mass/cell (Mceii). All 
get fixed when a particular battery type is chosen; 

(3) Number of cells in the battery (Ac); 

(4) Transmission ratio (Aj); 

(5) Mean torque (tm) at which the tool will be operating. 

The outputs of the model are: 

(1) Weight of the plate (M); 

(2) Speed at rated torque (cup); 
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engineering model for the precision centrifuge is therefore; 

t = xm/Nt, 

1/ = A^c X 1.2, 

E = Nc X Ecqw, 

R^=Nc X RceW, 
lA=r/KT, 

COR=NTiV -lA(RM-hRB))/KT, 

"'^max — VKt/(Rm + Rb). 

L^y = E/(iIRb + rm^), 

M = Mmot + A^C^cell + MtNj, 

D = Pr/(Wrt), 

ire 

t: motor torque, 

V: battery voltage, 

E : battery total energy, 

I A' steady state cun*ent, 

P : motor power output, 

Mt : transmissive mass factor. 

ome experimentation with the model makes it clear that many trade-offs are possible 
le design. For example: 

nergy can be added by using more cells in the battery, but this makes the centrifuge 
eavier; 

more efficient motor can be chosen but efficient motors tend to run at higher speeds 
nd so a greater transmission ratio is needed to get the required torque. 

mgineering models are useful in conjunction with the House of Quality because they 
iplete the chain from the entities the designers can actually change, the design variables, 
/hat the designer wants to affect, namely, the customer perception of the product. 

Final system configuration 

^ final system design has two centrifuges, each of which can be loaded with a carousel 
ny time. When the centrifuges are expeditiously loaded with carousels batched with 20 
?s, the machine can have a maximum achievable throughput (after the first 5 minutes 
:entrifuging) of 400 assays per hour. 

^ black and white LCD screen displays patients’ blood analyses, quality control and 
bration information. The system has the following features: 

/alkaway capability; 

liffraction grating optical reading station; 
utomatic patient identification; 

.,aboratorv Information Svstp.m intorfaro r.anabilitv 
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6. Conclusion 

The product development team’s group dynamics is critical to the success of a product 
(Finger & Dixon 1989). It is important that the team members interact and communicate 
effectively and understand each other’s discipline. The integrated product development 
incorporating tools from Quality Function Deployment, Pugh Concept Selection Technique 
and Engineering Model have been found to be effective in this project which involves 
people from diverse backgrounds. 


The author would like to express his sincere gratitude for the opportunities provided in 
the New Products Program in the Massachusetts Institute of Technology. In particular, he 
would like to thank Professor Flowers for his guidance in two projects carried out in the 
program. 
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Abstract. The next generation manufacturing technologies will draw on new 
developments in geometric modelling. Based on a comprehensive analysis of 
the desiderata of next generation geometric modellers, we present a critical 
review of the major modelling paradigms, namely, CSG, B-Rep, non-manifold, 
and voxel models. We present arguments to support the view that voxel-based 
modellers have attributes that make it the representation scheme of choice in 
meeting the emerging requirements of geometric modelling. 

Keywords. Voxel modelling; virtual prototyping; layered manufacturing; 
rapid prototyping; reverse engineering. 

Introduction and motivation 

confluence of three major technological revolutions is providing the impetus to the 

an turn changes in manufacturing technologies as we move into the next millennium. 

I Information technology: Increased power of processors, decrease in the cost of mem¬ 
ory, evolution and acceptance of the Internet and its rapid growth, the World Wide 
Web, multimedia and virtual reality technologies, distributed and parallel computing, 
growth of terrestrial, satellite, and broadcast communications, are some of the com¬ 
ponents of information technology that have a direct impact on manufacturing. For 
instance, the information superhighway with teleconferencing facilities will lead to 
large scale telecommuting of information workers. Computer-supported collaborative 
work will enable a geographically separated workforce to work as a team. 

) Material technology: Several new materials called smart materials, materials with 
memory, composites with highly specialised functionality, low temperature super¬ 
conducting materials, materials for fibre-optic communication, high efficiency solar 
energy converters, materials for solid-state batteries etc. have evolved. 

) Process technology: Advances in VLSI technology have enabled the production of 
multi-million transistor chips and 256 Mbit memories. The experience with VLSI tech¬ 
nology has inspired two diametricallv opposite process technologies, namely micro 
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(also refeixed to as rapid prototyping technologies, RPT). Both have their genesis in 
VLSI fabrication technologies and both require highly interdisciplinary research and 
development. Similar to VLSI technology, these two technologies depend critically 
on the development of new and powerful CAD tools. 

(i) MEMS: In this technology, the sensors, the processors and the effectors can all 
be fabricated in the same process. An example is a commercially available radar, 
costing $5 that can be fitted into automobiles to detect potential collisions with 
other automobiles. MEM systems promise to be the ultimate in mass production 
of electro-mechanical components, finished assemblies in millions of units at per 
unit costs of a few dollars. 

(ii) RPT; A range of new pmcesses variously called layered manufacturing, additive 
manufacturing and three-dimensional printing have emerged during the past few 
years (Burns 1993). These processes fabricate a component directly from the CAD 
description without the need for any additional fixtures, dies or tools. As opposed 
to the traditional approach of removing unwanted material from a block to arrive 
at the desired shape, these new processes build an object by selectively adding 
material where required by the geometric model. This technology promises to be 
the ultimate in customization, enabling the economical fabrication of one-of-a- 
kind finished assembly of complicated subunits. 

In addition to the three technological streams the changing economic scenario is forcing 
a relook at manufacturing strategies. The rapid globalisation of the economy and the 
increasing competitiveness in the manufacturing sector has led to new paradigms like 
concurrent engineering, just-in-time manufacturing and business process reengineering. 
Product cycle times have come down from a few years to a few months and less. From 
concept to the customer, time required has been constantly shrinking and now we have 
reached a stage where custom products and low-volume high-custom products have begun 
to dictate competitiveness. Agile manufacturing is the term coined to reflect this ability to 
meet ever-changing product specifications. In areas like wearable computing (Finger etal 
1996), where the associated technologies are constantly changing, completely new design 
philosophies have become essential. 

The combined impact of these changes can be expected to drastically change manufac¬ 
turing. Some of the extrapolated possibilities are conjectured below. 

• Customer-in-the-loop manufacturing: For example, custom kitchens are designed by 
the customer using VR interfaces, built and delivered later at the customer’s residence 
(Nomura et al 1992). 

• Coin-operated manufacturing/shopping-mall manufacturing: The previous scenario 
refers to consumer durables. A similar quest for customization of consumer goods like 
shoes and personal accessories may result in compact manufacturing units being located 
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le used. Examples are: kitchenware (spoons, forks, plates etc.) and toys. The customer 
ses highly interactive tools to design the objects and fabricates them in-house. The 
quipment has to be capable of making a range of shapes using a range of input raw 
aaterial (fresh or recycled). 

ipace-based manufacturing ; As humans expand into space, by first building space 
tations and later moving to the planets, newer manufacturing technologies will be 
leeded: flexible systems that occupy minimal volume and have minimum weight and 
re capable of fabricating a range of complex shapes will be required. 

)istributed product life-cycle management: Products will be conceived, designed, fabri- 
;ated, marketed, maintained and upgraded by collaborative teams distributed geograph- 
cally, 

iVhen we look at this exciting spectrum of possibilities, as well as the as-yet un¬ 
tight of new processes and materials, it becomes very important to examine the 
Is we need to make them happen. At the fundamental level, any material object 
jchanical, electromechanical) to be fabricated needs a model that forms the canvas 
which the design process generates diverse paintings. Any such object is captured 
a volume model. We use the term volume model since it is more general than the 
n solid model: a volume model can represent non-homogeneous objects (compo- 
js), flexible objects, and objects that are made of .solid, liquid and amorphous 
terial. 

[n this paper we examine the fundamentals of volume modelling for emerging manu- 
turing technologies. We first outline a set of requirements that a volume modelling 
•adigm should satisfy. This is followed in § 2 by a brief outline of existing modelling 
•adigms and their comparison based on our desiderata. In the final three sections we look 
;um at the requirements of three major aspects of manufacturing, namely, virtual proto- 
)ing, physical prototyping and reverse engineering, and indicate how voxel modelling 
miquely suited to meet these requirements. 

The major thrust of this paper is that voxel-based modelling has substantial unexploited 
vantages over other modelling paradigms. 


Desiderata for a volume model 

i present below a set of attributes that can provide a comprehensive characterization of 
epresentation scheme for volume modelling. We have grouped these attributes to reflect 
i three phases in the lifecycle of mechanical parts: 

Model creation and maintenance 

- renderability 

- morphological dexterity 

- heterogeneity 

- editability 

- brevitv 
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• Model analysis 

- analysability 

9 Model fabrication 

- reconstructibility 

- physical realizability 

- accuracy 

1.2 Rendembility 

This captures the ease with which the volume model can be rendered. Parameters to be 
considered are the complexity of the rendering computation and the quality of the rendered 
images as well as the ability to render the model at various levels of quality. For interactive 
modelling, the ability to rapidly preview the model, interact dynamically with the model 
as well as get high quality presentation images, are all essential. 

1.3 Morphological dexterity 

The ability to represent a variety of shapes is an important aspect of any volume modeller. 
Traditionally solid modelling has focused on representing shapes that are restricted to 
manifold topologies. However, the importance of non-manifold representations has been 
recognized (XOX Corporation 1995). 

The morphological dexterity attribute is aimed at capturing the complexity of shapes that 
can be represented and the ease with which complex shape modifications can be achieved 
on the given volume representation. 

1.4 Heterogeneity 

Traditional CAD tools available today implicitly assume that the part being designed is to be 
fabricated using a single homogeneous material. A few niche areas like aircraft/spacecraft 
use specialised CAD tools to model composites. The need for general CAD tools to han¬ 
dle composites will be realized as an increasing range of products are fabricated with 
composites. 

The heterogeneity attribute will evaluate the ability of a representation to capture inho¬ 
mogeneities of the following types: use of different materials, use of different densities of 
the same material across the cross-section of the solid, use of hollow structures (example: 
honeycomb) and integrated electromechanical components. 

1.5 Accuracy 

When evaluating a volume representation scheme with reference to the accuracy of the 
representation the following factors have to be considered: 

The accuracy with which the model captures the intended shape of the part is of primary 
importance. Of equal importance is the ease and accuracy of specification of tolerances 
consistent with the ideal design (Requicha 1984). 
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Editability 

i central purpose of a computer-based modelling paradigm is to empower the user to 
set changes to the model. We intend the broad term “editability” to encompass both 
al changes as well as global changes in the model. Examples of global changes are 
scaling or shearing of the entire part in a single operation. A less obvious, but equally 
Dortant aspect of editability is the system’s ability to allow the user to make such changes 
»th global and local) interactively. 

Brevity 

gardless of how inexpensive available memory has become, the requirements of users 
/e grown to make the compactness of the representation a critical issue. We intend 
brevity attribute to reflect the size of the model, the amenability of the representation 
various compression schemes and the ease with which modelling operations can be 
■formed on the compressed representation. 

Analysability 

alysability denotes the amenability of the representation to the following tasks in the 
dgn cycle: Computation/ex traction of physical parameters from the model to validate 
iinst specifications, functional analysis of the model (using finite element methods for 
rniple), and the visualization of the results of the analysis. 

> Reconstructibility 

constructibility is the attribute that qualifies the ease with which volume representation 
1 be obtained from a physical part, using any one of several measurement modalities 
g. range sensors, CAT scans, MRI scans etc.), or from a different volume representation 
the same part. 

.0 Physical realisability 

e usefulness of any volume representation scheme is considerably enhanced if a volume 
)del can be used to drive a fabrication process. In addition, the model may be used 
determine various attributes of alternate fabrication processes. An example of such an 
ribute is the time to machine a part. Physical realisability is the attribute of a volume 
)resentation that denotes these capabilities. 


Volume modelling: The state of the art 

this paper we examine four dominant modelling schemes, namely the boundary-repre- 
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Figure 1. (a) Boundary-representation (B-rep) model. 


The Boundary-Representation (B-Rep) scheme is a natural extension of the topologi¬ 
cal description of rigid solids, using complex data structures to represent vertices, edges 
and faces of a solid along with the topological constraints (such as adjacencies) and an 
elaborate system of pointers. A delicate balance has to be struck in B-Rep modellers to 
balance between efficiency of modelling operations and fast response to inteiTogations of ^ 
the model usually implemented via redundant access paths. A thorny issue for B-Reps 
has remained the issue of robustness. Geometric computations and topological constraints 
in the framework of a redundant data structure can often lead to internal inconsisten¬ 
cies in the reoresentation and to svstem r.rasbp.R PnmTnprriallv Jivnilnhlp. R-Bp.n RvstP.mS 
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popularized B-Rep modellers was the BUILD modeller, originally developed by IC Braid 
at Cambridge in 1979. 

Constructive Solid Geometry (CSG) Modellers build representations of complex rigid 
objects by applying unary and binary operators such as set union, intersection, difference 
to elementary volumetric primitives such as blocks, cylinders, cones, spheres and tori. The 
representation is usually schematically shown as a tree with the root representing the solid 
and the leaves representing the primitives. The CSG modelling scheme was proposed and 
implemented (PADL-2) by a dedicated group of researchers at the University of Rochester 
(headed by H B Voelcker and A A G Requicha) in the early 1980’s. The CSG representation 
is remarkable for its simplicity and succinctness and has therefore played an important 
role in the design of user interfaces for solid modellers. However, it is inadequate on some 
counts such as ease of rendering and analysis. 

The Non-Manifold Representation was developed as an attempt to meet the need for a 
scheme in which the transition between ID, 2D and 3D entities can be seamlessly executed. 
Dangling edges and faces, solids touching at vertices and edges and in the middle of faces, 
and other such irregular geometric constructs have to be valid in such a representation. 
Modellers built around such representations would be closed under general Boolean set 
operators and general sweep operators. Weiler (1988) proposed a radial edge data structure 
that could support such a representation. The approach that Weiler took was to extend 
the traditional B-Rep structure to admit non-manifold representations. SHAPES (XOX 
Corporation 1995) is a commercial non-manifold modeller developed and maintained by 
the XOX Corporation. 

Voxel modellers. Spatial Occupancy Enumeration or Cellular Modellers use a very 
simple representation scheme. A volume of space (containing the object) is divided into 
a large number of cuboidal cells. The system then labels each cell as to whether or not 
it is occupied by material. The obvious data structure for this representation is a 3D 
array of O’s and I’s. More sophisticated data structures based on quadtree and octree 
representations have been implemented in the popular modellers TIPS-1 (built by Okino 
and others at Hokkaido University) and INSIGHT (a molecular modelling system built by 
Phoenix Data Systems). Pratt (1994) in his review of solid modelling, states that,“... most 
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d modelers based on a purely cellular approach are still in the research domain. This 
ears to be because a satisfactory trade-off has yet to be found between the requirement 
accurate boundary resolution needed in engineering applications and the very large 
•age requirements that this implies.” 

lequicha (1980) published a seminal study of solid modelling representation schemes, 
nore recent update of this work is by Pratt (1994). The interested reader is directed 
hese papers for more details on the individual schemes. The above studies focus on 
comparison of the B-Rep and CSG solid modelling schemes, while dismissing the 
:el-based modelling schemes as being impractical. 

n this paper we re-evaluate these three schemes with reference to the desiderata listed 
ow in table 1. As mentioned above, the non-manifold modelling paradigm evolved as 
extension of the B-Rep representation scheme. However, in principle, non-manifold 
dellers need not follow only this line of development. Hence we evaluate non-manifold 
resentation schemes as separate from B-Rep of solids as also indicated in table 1. 

\ representation that flags geometric features of a part that helps identify manufacturing 
cesses that match with a particular feature will enjoy tremendous increases in efficiency 
;he CAD system built around it. Such hierarchical representation schemes have come 
3e feature-based. The need for them grew out of the fact that B-Rep and CSG 
resentations are too low-level for a designer to work with. Feature-based schemes are 
s an expedient interface between CAD systems today and analysis or manufacturing 
)lications (Rossignac 1990). We do not consider this scheme in our comparative study 
v'olume representations since feature-based models exist at a higher level and can, in 
mciple, be equally well-integrated with any of the B-Rep, CSG, Voxel or Non-Manifold 
ume representation schemes. 

fable 1 summarizes our evaluation of the various schemes for volume modelling via a 
ng of each scheme on each of the ten attributes in our list of desiderata. The following 
brief remarks on the ratings given in the table. The rating of voxel models will be 
cussed in some detail in the sections that follow. 

fhe boundary representation scores high on renderability, morphological dexterity, ac- 
acy, editability (local and global), analysability, reconstructibility and physical realis- 
lity. The detailed nature of this representation, with explicit access of vertex, edge and 
e components, permits efficient data structures and computations on B-Reps. The ability 


»le 1. Attributes of the dominant modelling schemes. 
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in principle to accommodate free-form bounding surfaces explains the high accuracy and 
morphological dexterity of this representation. On the other hand, the B-Rep scheme is 
verbose and has no provision to access volume elements in the interior of objects, and 
hence this representation scores low on the counts of brevity and heterogeneity. 

Constructive solid geometry representations fare somewhat poorly in our evaluation. 
The only count on which we favour this representation is its brevity. This representation 
played an important role in the early development of solid modelling because it permit¬ 
ted approximate representation of complex shapes via regularized Boolean operations on 
simple primitives. However, the indirect and implicit form of this representation makes it 
cumbersome for interrogations that involve structure of the model. 

Non-manifold models have even more detail explicitly given in their representation than 
B-Rep models. Hence they enjoy all the advantages of B-Rep models except on the count 
of physical realisability. The non-manifold representation allows features such as dangling 
edges and faces which are hai*d to fabricate. We rate non-manifold representations a little 
higher than B-Reps on global editability because of access to volume elements. On the neg¬ 
ative side, non-manifold models suffer from low ratings on brevity and heterogeneity just as 
B-Reps do. In principle, because of access to volume elements, it is possible to work some 
heterogeneity into non-manifold models but this seems to involve considerable overheads. 

3. Voxel modelling in emerging manufacturing technologies 

In this section we focus on the attributes in which the Voxel methods outperform the other 
modelling schemes as shown in table 1. We believe that it is this set of attributes which 
can be exploited to tackle the challenging problems encountered by newer manufacturing 
technologies. 

3.1 Voxel modelling for virtual prototyping 

“Virtual prototyping is the process of design analysis, simulation and testing of a product 
within the computer and use of the results to refine the product before making a physical 
prototype” (Kumar et al 1996). 

The excellent morphological dexterity of voxel models combined with their excellent 
local editability imply that CAD tools built using voxel models will allow arbitrary shape 
manipulation and sculpting. Such expectations have been fulfilled by interactive voxel 
modelling tools currently under development (Ravi 1996; Sethia 1996). 

In VoxLab (Ravi 1996), it is possible to edit objects at the voxel level as well as use 
global operations to modify the volume structure of the object. 

Interactive sculpting is the process by which a designer can impose arbitrary free-form 
shape changes on the object being designed. Considerable work has been done in the field 
of surface-based sculpting, which can be broadly classified into space deformation and 
surface fitting. Terzopoulos & Qin (1994,1995) have developed dynamic NURBS and dy¬ 
namic triangular B-Spline models which are physics-based models. These models are used 
for interactively sculpting curves and surfaces by applying simulated forces and local and 
global shape constraints. Pasko & Savchenko (1995) have approached the problem of inter¬ 
active sculpting as the local deformation of a constructive solid by a set of arbitrary points 
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t belong to the resultant siuface. Rappoport et al (1995) have developed a method for 
delling an object which is composed of several tensor product solids, while preserving 
desired volume of each primitive. Bon'el & Rappoport (1994) have proposed a model 
producing controlled spatial deformations. In their model, the user defines a set of con- 
l points, giving a desired displacement, and radius of influence of each. Each constraint 
nt, determines a local B-Spline basis function centred at the control point, and falling 
?:ero for points beyond the radius. Rappoport et al (1994) have introduced the concept 
soft constraints which need not be met exactly, thus increasing the design space. Fi- 
ly, McCracken & Joy (1996) have proposed a new free form deformation technique that 
leralizes previous methods by allowing 3D deformation lattices of arbitrary topology, 
rhe main drawbacks of these methods are that they are object-complexity sensitive, 
y do not operate on the object being sculpted directly, and it is very hard to modify the 
loiogy of the object being designed. 

Voxel-based sculpting is proving its promise in our on-going work. We use Minkowski 
orations to sculpt voxel blocks using tools of arbitrary shape which are themselves voxel 
(dels (Sethia 1996). An object created by such sculpting operations is shown in figure 2. 
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Mechanical analysis is an integral part of the design process since the physic 
properties of the parts determine if a particular design is viable or not. Traditional CA 
tools for analysis use the finite element method (FEM) to compute various material prope 
ties like structural strength, thermal conductivity etc. on the geometric representation of tl 
part. The FEM methodology is essentially a technique for computing discrete approxim 
tions of continuous physical quantities. For B-Rep or CSG tree models the discretisatii 
required by FEM is realized by tetrahedralisation of the design volume. A major coi 
putational issue in this context is the maintenance of non-degenerate tetrahedralisatii 
(ensuring connectivity, eliminating hanging edges etc.). These problems do not arise ir 
voxel model since the basic unit of a voxel model is a volume-filling solid, namely, t 
voxel. Thus, accurate FEM discretisations can be easily extracted from a voxel-model 1 
analysis. 

The visualisation of the results of the analysis is facilitated by volume models; dire 
volume rendering techniques can be used to render the results of the analysis volumetrical 
superimposed on the voxel-model. Interactive volume visualisation tools allow the user 
view the results along any cutting plane of the model, or as a composite whole. 

Several problems that are difficult using B-Rep or CSG or non-manifold approach 
have a simple solution using voxel models. These include estimation of mass propertii 
interference detection, tolerancing and implementation of CSG operations. 

Tolerancing is an important aspect of mechanical design (see ElMaraghy et al 1994, i 
example). A voxel-based modeller can handle this aspect of the design as follows. T 
intersection of two objects due to the variation in tolerance is visualized as a 3D interferer 
volume itself. The interference volume is a measure of the extent of interference, a 
therefore a measure of the “out-of-tolerance” or deviation of the object. The tolerat 
values can be reassigned, and a decreasing volume of intersection indicates a correct tre 
of tolerance alteration. 

The use of interference volumes can be extended to solve the problem of assemblabil 
and disassemblability testing. Other modelling schemes require complex geometric int 
section algorithms and may also suffer from numerical inaccuracies. With voxel mode 
the interference can be precisely quantified and accurately located. 

Combining the good analysability of voxel models with global and local editability 
is possible to design objects with excellent shape optimization. A naive approach is i 
following; Map the results of the engineering analysis to the voxel model. Use a glol 
filter that removes voxels that do not have any load on them, after ensuring that the 
voxels are not essential for maintaining the structural integrity of the part. An exam] 
of a shape-optimizing structure is a honeycomb mesh that can replace solid interiors 
locations on the part that are not subjected to any loads. This will reduce the overall weij 
of the object as well as result in material saving. The morphological dexterity of vo: 
models permits more sophisticated optimizations that will be impossible on B-Rep or C) 
models. 

3.2 Voxel modelling for physical prototyping 

In spite of the tremendous strides made in virtual prototyping technologies, physi 
prototypes are indispensable in the concurrent engineering process. Hence attentior 
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ig directed at making these physical prototypes as rapidly as possible. Traditional 
totyping processes range from wooden mockups, casting from custom patterns, ma- 
ling, and the use of sophisticated NC machines that are directly programmed from 
CAD model of the part. Rapid prototyping, the ability to produce a prototype very 
y in the design cycle, even before the tooling for the manufacture of the component 
been designed, has emerged as a critical technological component of concurrent engi- 
ring. Recently, several new manufacturing technologies that support rapid prototyping 
e become available both in research laboratories and in the commercial marketplace, 
ise technologies are variously called layered manufacturing, additive manufacturing, 
stereolithography (Jacobs 1992). These manufacturing techniques allow a part, or a 
totype, or a tool to be built by the gradual addition of material in a controlled way. In 
trast, traditional manufacturing methods depend on the the removal of material (milling, 
ling etc.) or on the deformation of material (casting, moulding etc). The field of rapid 
totyping is making rapid strides (Bums 1993). 

i-PTs provide the means to manufacture parts that cannot be realized by conventional 
;hods (Kruth 1992). Arbitrary shapes, composite materials, and complicated geometries 
be as easily manufactured as simpler geometries. In order to exploit these capabilities, 
modelling schemes used by CAD tools to design parts should be capable of handling 
inge of shapes including non-manifold objects. Thus, non-manifold schemes (XOX 
poration 1995) and voxel-modelling are the only schemes that have this capability, 
lowever, for converting a virtual prototype into a physical prototype using RPT tech- 
ogy, the current practice is to convert all models into the stereolithography format (STL 
nat) that has emerged as the de facto standard. The model in STL format, basically 
D tiiangulation, is then input to a slicing process (that is unique to each of the lay- 
1 manufacturing equipment) that generates a sequence of slices that drive the material 
ition process. 

^hese two steps, namely, converting into a STL format and the slicing of the STL format 
oduce changes in the model that are beyond the control of the part designer. 

Jsing a voxel-based modelling scheme it is possible to completely eliminate the above 
» steps and instead drive the fabrication process directly. This is facilitated by the close 
respondence between voxel models and the fabrication processes. 

"he current range of rapid prototyping machines can be broadly classified into two major 
jgories based on the way in which material is added to an object under construction. 

Parallel/image-based: The most prominent system under this class is the Solider MPM 
ystem of Cubital, which is a commercially available system. A well-known research 
ystem under this category is the MD* (Weiss & Prinz 1991). In these systems, successive 
ayers of the component under fabrication are created by the use of masks that either 
Ilow a light source to solidify a photopolymer under the exposed regions, or spray 
aaterial on the exposed parts of the mask. The advantage of this approach is that the 
ime for creating a complete layer does not depend on its geometric complexity. Each 
aask is simply a slice of the object, and can be thought of as the image of the object’s 
ross-section. 
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this categoiy. In these systems, a layer is formed by the sequential formation (deposition, 

or solidification) of the contours in the object’s cross-section. Hatching or filling-in 

operation is needed to obtain solid interiors. 

Some RPTs, still in the research stage, like the ballistic particle manufacturing, and the 
beam interference solidification process, create parts by solidifying individual voxels. 

Even some layered manufacturing techniques like stereolithography and selective laser 
sintering use point-by-point solidification to grow each layer. Thus these processes work 
one level below the layer-by-layer manufacturing techniques and this involves the conver¬ 
sion of layer information to voxel information, usually performed by the manufacturing 
equipment itself. 

At every stage in the design process, the description of the part is directly the description 
required by the manufacturing process. Thus it is easy to evaluate the design with respect 
to its manufacturability. 

Several cuiTent and proposed RPTs generate parts by adding material one layer at a time. 
The MD* process (Weiss & Prinz 1991) uses this approach using photographic masks and a 
vapour deposition process that is very similar to the technology used for VLSI fabrication, 
However, the input to the manufacturing device is currently obtained by a postprocessing 
step after the design of the part is completed. This step involves the generation of slices of 
the finished part, the number of slices generated depending on the layer resolution of the 
RPT. The hardware itself is neutral to the CAD tools or primitives that have been used in 
the design of the part. However, the use of traditional CAD tools that use either B-Rep or 
CSC as the basis will imply that the postprocessing step is left to deal with problems like 
the proper orientation of the part before slicing for optimal manufacturability (minimum 
use of support material, protection of the structural integrity of the part at every stage of 
the growth process etc.). 

STL (Stereolithography list) was developed by 3D System for their stereolithography 
machine and has become a de-facto standard input format for several RPTs (Jacobs 1992). 
This PHIGS-based format approximates the part’s inner and outer surfaces by a set of 
triangular plane patches. Each patch is described by its three vertices and a normal vector 
pointing out of the material. 

Though it is possible to generate STL from voxel models to conform to existing RPTs, 
the voxel models permit generation of outputs in formats more closely matched to the target 
RPT. For instance, if the RPT uses a layer-by-layer approach, the voxel-based modeller 
can output the layer information after taking into account several process-planning steps, 
even during the design phase. This will eliminate the need for any processing by the RPT. 

Composites: Research in LM technologies will make it feasible to fabricate components 
with several constituent materials that are physically located to ensure optimum perfor¬ 
mance. The extrusion process used by the Fused deposition modelling (FDM) of Stratasys 
(Bums 1993) can use polymers as well as metals of low melting point like tin. The MD* 
process of CMU uses a robotic arm to spray different metals over masks. It is very likely 
that in the near future, LM technology will mature to a point where multiple materials can 
be used to fabricate a single component. 

The ease with whichheteroaeneitv can bchandlprl hv vhypI mnHplc pnahlpQ Hpsianp.rsto 
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Conventional design tools are not oriented to the design of composite objects. Specialized 
tools are used in areas like the aircraft industry where composite materials play a major 
role. 

A voxel-based modeller can ultimately provide the capability to design a composite 
object with materials selectively placed at individual voxels. There is no need to compute 
the complex geometries of the interleaved materials since, for fabrication, each slice of 
the voxel buffer can be directly read out and several masks per layer can be created to 
deposit the different materials. Such capabilities will be indispensable as the technology 
of microelectromechanical systems (see Senturia etal 1992, for example) matures: A voxel 
will then be of molecular dimensions. 

Inspection of the fabricated part to ensure its compliance with specifications is an 
important aspect of manufacturing. A voxel-based approach has been successfully used to 
implement an on-line automated visual inspection system (Tarbox & Gottschlich 1995). 

Fabrication of components with embedded electronics will be an increasingly com¬ 
mon requirement. The design of wearable computers is one such example that requires 
rapid product development (due to the rate of change of the component technologies) and 
prototyping. A simple approach that exploits volume models in combination with layered 
manufacturing technology is as follows: Volume-scan the electronic subunit, combine with 
a voxel model to give an integrated model of the component and insert the electronic unit at 
a suitable time during fabrication. Implementation of this basic process requires advances 
in both CAD tools and layered manufacturing tools. 

Chandru et al (1995) elaborate on the above advantages of using voxel models for rapid 
prototyping. 

3.3 Voxel modelling for reverse engineering 

Voxel models have excellent reconstructibility. This implies the following. 

o It is easy to obtain the voxel model from a physical part. In practice, CAT and MRI 
scans directly give a three-dimensional voxel array of the model while range sensors or 
3D scanners provide a cloud of points from which a voxel model can be obtained. 

• It is easy to convert models from other representations to voxel models. This can be 
achieved using a class of algorithms called voxelisation algorithms (Wang & Kaufman 
1993) that convert geometric primitives into an array of voxels. 

Reverse engineering is the process of construction of a CAD model from a physical part. 
Measurements are made on the physical prototype (typically by scanning) and these are 
then processed to obtain the CAD model. Thus voxel models are ideally suited to support 
reverse engineering. 

In addition to the traditional reverse engineering task of obtaining a computer model 
from a physical prototype, the voxel-based approach leads to other possibilities. Reverse 
engineering in conjunction with the voxel-based modelling and analysis provides a means 
of reengineering a part from a sample physical model. 

We are investigating the use of such reverse engineering in conjunction with rapid 
prototyping equipment to provide a platform for rapid product innovation and design 



modelling and volume-error measures. 


4. Research issues in voxel modelling 

Table 1 lists some attributes for which voxel models compare poorly with the other models. 
These are precisely the attributes that have been the bottlenecks to the widespread use of 
voxel models. In this section we address each of these attributes and indicate research 
directions aimed at overcoming the limitations. 


Rendering: Current graphics systems are predominantly polygon-based. Recent desktop 
workstations, even at the entry level, are capable of rendering close to a million triangles 
per second. Similar hardware advances are yet to take place for volume rendering and 
hence rendering of reasonable voxel volumes (say 512x512x512) takes a few seconds. 
However, various ways to speed up volume rendering are being vigorously pursued. These 
include the use of texture mapping hardware to perform back-to-front composition of 
volume slices (Fraser 1994), the development of special-purpose hardware for volume I 
rendering (Pfister et al 1995), and the use of distributed and shared memory workstations. ; 

Global editing: Voxel modelling has excellent local editability and certain global operations { 
(analogous to filtering) that alter the entire model can be implemented witli ease. However, | 
they suffer from a lack of information about the component structures of an object. For | 
instance, a voxel model of a table will allow one to edit the sharp comers into rounded i 
corners, but it will be difficult to replace the four legs of the table with legs of different : 
shape. This is because there is no notion of a substructure in the basic voxel models. I 
Research into hierarchical models that permit such substmcturing, without disturbing 
the other advantages of voxel models may be thought of as feature-based modelling using 
the voxel as the basis. This appears to be a fruitful research direction. ^ 

Brevity: A serious practical problem with voxel models is their poor brevity. A regular voxel ; 
array to represent a component with a 5 cm bounding box will require a voxel resolution | 
of about 400^ to realize a physical part with an accuracy of 125 microns. | 

Clearly, this is another reason for hierarchical and adaptive representations. Octrees, I 
BSP trees and similar data stmctures combined with algorithms that effectively operate 
on these structures is a critical area of research. Another facet is that of compression { 
algorithms and algorithms that operate on compressed data stmctures. 


Accuracy: The accuracy of voxel models is inversely related to the brevity of the represen¬ 
tation. Since by definition a voxel representation is a discretization of continuous geometric 
entities, the accuracy of the representation will be poorer when compared to CSG or B-Rep. 
Aliasing errors due to the discretization process will be present. However, the redeeming 
aspect of voxel models is the possibility of exact quantification of the error as well as user 
control of the error. Research into the application of anti-aliasing techniques of computer 
graphics to voxel models and the use of volumetric error measures is a promising area of 
research (Prakash & Manohar 1995). 
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Abstract. We present a framework based on Domain Independent Forai (DIF) 
features for automatic evaluation of manufacturability and process planning for 
machining. The framework enables interpretation of a common product model 
with respect to each task in the transition from design to manufacture. A key 
idea here is to generate the interpretation suitable for each task in two steps. 
In the first step, DIF features that are defined through feature enumeration are 
automatically extracted from the geometric model. The extracted DIF features 
are then mapped into features meaningful for individual tasks through geometric 
reasoning based on domain dependent knowledge. The formal approach to 
feature definitions and separation of the domain specific reasoning from the 
general geometric reasoning enable us to overcome the bottlenecks reported in 
features technology. 

Keywords. Feature-based manufacturing; feature extraction; feature map¬ 
ping; visibility; process planning; manufacturability evaluation. 


Introduction 

5 concept of feature evolved from the desire to devise methods for integration of part 
metric model with applications such as process planning, group technology coding and 
nerically controlled (NC) tool path programming. Features are high level abstractions 
)art geometry such as holes, slots and ribs around which engineering knowledge and 
ertise is stmctured. Hence reasoning about part geometry in applications such as process 
ming becomes easier to automate when part description becomes available in terms of 
tures rather than basic topological entities (face, edge and vertex) that are available in 
geometric model. These features are referred to sls form features as they describe the 
m and shape of a part. 

rhe motivation for integrating part geometric model (also referred to as the CAD model) 
h applications such as planning for manufacture, comes from the need to compress prod- 
development lead times and to reduce costs. The sequential nature of product design 
i the necessity of human intervention in the transition between individual tasks have 
in responsible for large lead times and costs in the conventional product development 
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cycle (Bakerjian 1992). By allowing downstream lifecycle concerns to influence product 
design during the design process itself, delays and costs due to the sequential nature of the 
development process can be reduced considerably, if not eliminated. In the context of prod¬ 
ucts manufactured by machining, this would imply ensuring that the design is machinable 
during the design process itself. Eliminating human intervention in the transition between 
tasks involves automating the geometric reasoning involved in interpreting part geometry 
appropriate to the task domain and the process-based reasoning to realise the task. 

Automation of evaluation of manufacturability and planning of manufacture requires 
that part information be available at different levels of abstraction and detail. For instance 
the task of generating a process plan involves sequencing of machining instructions (gross 
process plan) and generation of NC tool path (fine process plan). The first requires infor¬ 
mation on entities such as slot, step or hole, whilst the second also needs the geometric 
details such as surface normals and locations. The domain knowledge/expertise of each 
task in product development is structured around features. Hence it is essential to support 
features to automate the reasoning involved in evaluation of a design with respect to a task. 

There have been three routes followed by researchers to realize feature information to 
reason about process (Shah 1991). These are as below. 






• Interactive feature definition: In this approach, features are defined by the users by 
picking entities, associated with each feature, from a CAD model. This approach has been 
used mainly to input data to process planning and tool path generation programs (Chang 
&Wyskl985). 

• Feature-based modelling (FBM): In this approach the feature information is incorpo¬ 
rated in the model construction stage itself. This is achieved by using features to build the 
model (Salomons et al 1993). This method of modelling is a generalisation of the ‘de¬ 
sign by features’ approach. In the ‘design by features’ approach only design features are 
used, whereas in FBM, features of interest to any one specific application can be used. In 
most implementations, feature-based modelling is based on Constructive Solid Geometry 
(CSG), where the features are available as primitives in addition to the regular primitives 
such as cylinder and block. Commands of a solid modeller to create the feature are gen¬ 
erated based on the feature selected. Arbab (1992) modelled parts by removing volumes 
corresponding to machining features from the stock solid. Part feature model thus obtained 
was used to link design with process planning. Shah (1988) implemented a general purpose 
feature-based modelling system. This system allows the user to define a set of features 
along with all its manipulation functions. 

• Automatic feature extraction: In this approach, form features of interest to an appli¬ 
cation are automatically extracted from the geometric model of a part. Automatic feature 
extraction has fundamental significance from the integration perspective of product devel¬ 
opment cycle. Reasoning involved in the downstream applications cannot be automated 
without correct interpretation of part design with respect to each task. Since most of the 
established application knoweldge is based on human interpretation of part geometry, it is 
essential to generate human-like interpretation for complete automation. 

Automatic extraction of features from a geometric model is a difficult problem. Ap- 
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have been proposed to solve this problem. These vary from syntactic pattern recogni¬ 
tion techniques (Kyprianou 1980), graph matching (Joshi & Chang 1988), and volume- 
decomposition (Shah etal 1994; Tseng & Joshi 1994; Sakurai & Dave 1996) to hint-based 
techniques (Vandenbrande & Requicha 1993). The reader is refen'ed to Shah (1991) and 
Subrahmanyam & Wozny (1995) for reviews of the literature on feature recognition. 

None of the commercial CAD/CAM/CAE software that claim to be feature-based can 
integrate design with manufacturing, analysis with other downstream tasks. Most of these 
software support only the FBM approach. In this approach a task-specific model can be 
obtained only by restricting the user to models using a particular set of features. For other 
tasks, the part has to be modelled differently and hence both data integrity and development 
time are adversely affected. 

On the other hand, feature recognition approach to generate feature models is only now 
beginning to see commercial development (CAD 1997). Most of the techniques reported 
in literature have limited scope and their extension to handle real world part complexity 
is not proven. Implementations reported so far consider only one application (process 
planning for machining) and domain specific heuristics are used to recognise interacting 
features (Subrahmanyam & Wozny 1995). The most formal and general approach is graph 
matching, but sub-graph isomorphism is in NP-complete list. This approach also suffers 
from combinatorial explosion problem as sub-graphs of all feature instances have to be 
defined in advance and checked for. Techniques based on decomposition (Shah et al 1994; 
Tseng & Joshi 1994; Sakurai & Dave 1996) are proving to be more successful than graph- 
based approaches in handling feature interactions, at least for simple cases. 

In the proposed framework, domain-independent form (DIF) features are used as the 
intermediate representation. The term ‘domain-independent’ is used to imply that the 
definitions and classification of the features are based purely on form (geometi 7 and 
topology) and not on considerations arising from various application domains such as 
machining, near-net shape processing, analysis, or assembly. The DIF feature set is then 
used to obtain a feature set corresponding to other application domains through a process 
called feature mapping. In the following we first present the feature-based framework for 
manufacturing (in particular, machining). Definitions for DIF features are presented next. 
We then highlight some of the results in feature extraction, mapping and manufacturability 
evaluation. We conclude with some open issues in this area, 

2. Feature-based manufacturing framework 

A product typically consists of several electrical, electronic and mechanical parts. The 
framework proposed addresses only the development of discrete mechanical parts. Part 
development involves more than one task, and even in a single task part data may have to 
be interpreted in more than one way. For example, if a part has to be manufactured through 
machining, then it has to be considered for process planning, which in turn consists of 
several sub-tasks such as stock selection, fixturing, sequencing, tool selection, NC tool 
path generation and final inspection. 

Generating part interpretations suitable to each task through FBM approach would in¬ 
volve modelling the same part differently for each task. This would adversely affect the 
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Figure 1. Feature-based machining system. 


concurrency and development time. On the other hand, implementing feature extraction 
specific to each application involves bringing down the level of abstraction of each applica¬ 
tion to the level of basic geometry, which is a difficult task. Moreover, automatic handling 
of non-geometric data which can only be attributed to features is not clear at present in the 
feature recognition approach. 

The proposed framework is designed such that feature-based modelling (FBM) and fea¬ 
ture extraction approaches complement each other (see figure 1). Migration from geomet¬ 
ric modelling to product modelling is effected through providing features for modelling. 
Product data which are difficult to attribute to basic geometric entities can be attributed to 
features. Unlike the conventional FBM approach, the designer is not restricted to use only 
features for modelling and can use boolean operations over primitives along with features. 
As features get modified in the process of modelling, the design model does not corre¬ 
spond to the feature model of the final solid. In the proposed approach, part geometry is 
represented as boundary representation (B-rep). Product data are represented as attributes 
to entities in the design model and geometric model. 

Design model is an unevaluated representation of the design process. Reasoning over 
design model to generate task-specific interpretations is difficult because of its subjectivity 
(a designer’s viewpoint) and non-uniqueness. Hence the design model is referred only for 
product data in the mapping process. From the evaluated geometric model (B-rep), DIF 
feature model is created through automatic feature extraction. Then the extracted DIF 
features are mapped into each specific task. DIF feature interpretation is at a higher level 
of abstraction than the basic geometry. Hence, knowledge-based reasoning involved in 
mapping can be implemented without bringing down the domain knowledge of each task 
to the level of basic geometry. 

We have taken a general and formal approach to DIF feature classification and defi¬ 
nitions. Based on this formalisation, problems in feature recognition and feature-based 
modelling are treated generally without using any one domain specific heuristic. All the 
task specific issues are addressed in feature mapping and subsequent process specific mod¬ 
ules, process planning and NC tool path generation (figure 1), where domain knowledge 
interacts with the part’s DIF interpretation. Process-based reasoning has been divided into 
three modules, as the interaction of process and part geometry is of different degrees in 
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h of the modules and the nature of reasoning is also different. For example, in NC 
1 path generation the task primarily involves geometric computations, once the cutting 
ameters are fixed in the planning module. 

n this paper, we present a feature-based approach for the planning module where the 
ective has been to use geometric reasoning to provide plans that can then be pruned by 
istraints from the process or processing environment. This is illustrated in the schematic 
ftgure 2. The output from the mapping module of figure 1, machining features, forms 
input to this module. Module V_Map uses visibility checks on the faces of each feature 
generate access directions for machining the feature. A global accessibility check along 
'feasible set of the access directions is then used to evaluate manufacturability of the part 
I if the part is manufacturable, to generate all possible sequences for machining the part, 
is sequence tree is then pruned by subjecting it to constraints from the machining domain 
:h as set-up optimisation, handling datums/references, existing machining practice to 
;ain one or more machining sequence. The remainder of the paper explains this approach 
i provides some results. 


Domain-independent form features 

th feature extraction and feature-based modelling require a set of well-defined form 
hires. A feature has to be defined either interactively or through a language before it 
1 be used either by feature recognition or by feature-based modelling modules. Form 
tures unique to certain applications such as twists, louvres, tabs of sheet metal, and 
:ondary features such as knurl, teeth and threads interact with the existing part geometry 
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Figure 3. Sweep structure and face classification for DIF features. 


depends on the application, context, and interactions. Hence we focus on classificati 
extraction, mapping and manipulation of primary (non-unique) features. 

The conventional approach (Butterfield et al 1986) is first to collect form features 
interest to a particular application. Collected features are then organised through clai 
fication into hierarchies and feature definitions are derived. In this approach, the num 
of form features collected and the criteria used for classification are subjective. Hei 
finiteness and completeness of the defined feature set cannot be established with resp 
to possible variation of topology and geometry in designed components. Unlike ear 
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)roaches we have defined features through enumeration of possible topological varia- 
is in designed solids. 

Ne have modelled form feature-generation as subtraction/addition of a solid-piece 
m/to a base.solid (Rao Nalluri 1994). The solid on which the feature is created is 
sired as the base_solid and the minimum solid that is required to create the feature 
■eferred as the solid-piece. In the generation of a feature, some existing faces of the 
;e_solid are modified and some new faces are created. The modified and the newly 
ated faces are classified as ‘shared’ and ‘created’ faces respectively w.r.t. the generated 
ture. The solid-piece faces can be classified into ‘shell’ and ‘end’ faces, and combining 
two classifications results in four face types: Created Shell Face (CSF), Created End 
:e (CEF), Shared Shell Face (SSF) and Shared End Face (SEF) as shown in figure 3. 
s geometry and arrangement of ‘created’ faces of the feature is defined as the ‘shape’ 
the feature. The connectivity of the feature face set with the rest of the base-Solid is 
ined as ‘type’ of the feature. In the feature generation model, keeping the variations 
ich affect the ‘shape’ independent, variations which affect ‘type’ are enumerated and 
leric feature definitions are derived in terms of the four face types as given in table 1. 


Direct manipulation through form features 

iture-based modelling (FBM) provides an intuitive design environment and enables at- 
mtion of product data to the features. The common approach to implement FBM system 
o have features supported only at the user interface. For all manipulations of geomet- 
representation, features are converted into solid primitives and boolean operations, 
contrast to earlier approaches we have developed algorithms for direct manipulation 
10 Nalluri & Gurumoorthy 1993a) of boundary representation (B-rep) through fea- 
es. Feature generic ‘type’ dictates the manipulation involved in creation and deletion 
a feature instance. All feature-based editing of B-rep are effected through deletion of 
old feature and creation of a modified feature. Feature relations are caotured throueh 
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shared/created face classification. Automatic adjustment of features in editing is effec 
based on the captured feature relations. Manipulation of B-rep directly with featu 
through computationally efficient incremental upgradation distinguishes our work fr 
other efforts in this area. 

5. Extraction of DIF features 

Extraction of DIF features involves decomposing of the input solid into a set of instan 
of enumerated generic form features. We have modelled the feature extraction process < 
reverse process of feature generation. In the process of feature generation the topologi 
complexity of the base_solid increases. Hence in the process of feature extraction part sc 
should get simplified with extraction of each feature. The idea of topological simplic 
is implemented through the concept of Dynamic Topological Status (DTS) (Rao Nal 
1994; Hari 1995). DTS of a face is a set of topological factors about the face which desci 
its topology. Since features are created in a certain order there is an order in their extract! 
A feature gets extracted when all its faces are only ‘created’ faces and not the ‘shar 
faces of any other features. ‘Created’ faces have simple topology compared to ‘shar 
faces. DTS values of faces enable clustering of ‘created’ faces of a feature and also ^ 
clues about the probable ‘type’ of the feature. The probable feature ‘type’ is confim 
based on the enumerated feature definitions. For example, the clue for a blind.slot feat 
is a face with all 4-edged neighbours except one corresponding to single shared shell f 
(SSF). The confirmation procedure for feature type blind_slot is as given below. Once 
‘type’ is fixed, the feature face set is identified and fixing ‘shape’ becomes a local match 
problem. 

Procedure. BLind-slots(face[i]) 

Feature-Type := NULL 

If (face[i] DTSl ! = 4 and DTS2=1 and DTS3=0) 

End-face l:=face[i] 

, Find the single non-four-edged neighbor of End-face 1 
If (It is of type SSF) 

Find End-face2 

IF(DTS of End-face 1 and End-face2 are IDENTICAL) 

IF(Angle btw End-face 1 and CSF’s is CONVEX) 
Feature-Type; = Double Blind Slot 

ELSE 

Feature-Type:= Double Blind Rib 

ELSE 

IF(Angle btw End-face 1 and CSF’s is CONVEX) 
Feature-Type:= Blind Slot 

ELSE 

Feature-Type: = Blind Rib 

Retum(Feature-Type) 

End Procedure 


me separation or type ana snape in reamre aennitions reaucea me numoer or con¬ 
firmation procedures required from (n? x ris) {ut = number of types, ris = number of 
, shapes) to {ut -\-ns). The complexity of the search depends on the number of faces rather 
“■/ than the number of feature ‘types’ supported. Hence the complexity of feature extraction 
is 0{nf^ loginf)). Since all feature interactions are covered in enumeration for defining 
features, interactions do not require any special heuristic procedures for their resolution. 
The order obtained in the extraction can be used in mapping. 

Inability to deal with interacting and intersecting features in a general way is one of the 
bottlenecks in features technology. We attribute this problem to (1) incompleteness of the 
feature set w.r.t. to topological and geometric variation in parts, (2) attempting to recog¬ 
nise a task specific feature when it is not explicit through relaxation of feature definition 
and application of domain-specific heuristics. The formal approach to feature definitions 
enabled us to define a complete set of feature types that is required to cover the defined 
topological and geometric variation. In the case of intersecting features, as we do not at¬ 
tempt to recognise the task-specific features directly, the most explicit feature is recognised 
^ without any application concern. Later in the mapping, based on the relationships among 
extracted DIF features, application specific interpretation is obtained. 


6. Mapping DIF features to applications 

Mapping involves knowledge-based reasoning along with geometric reasoning. Rela¬ 
tions among extracted DIF features are explicitly represented as Feature Relational Graph 
(FRG) to apply the high level application knowledge. FRG is constructed based on the 
shared/created face classification. A feature X is said to be a child of another feature Y, if 
and only if, at least one shared face of feature X corresponds to a created face of feature Y. 
FRG is an acyclic graph which stores the child/parent relationships among features. Map¬ 
ping modules have access to (1) geometric model, (2) FRG of extracted DIF features, and 
(3) design model. Both (1) and (2) representations are domain-independent, and entities 
in representation and their interpretation are well defined, whereas the design model is a 
designers interpretation and hence is subjective. It is mainly referred to get the task-specific 
product data. These three inteipretations at different levels of abstraction provide all the 
information at the required levels to enable knowledge-based reasoning in mapping. 

The inputs to the mapping module are stock B-rep, part B-rep, and FRG. The feature 
extraction module classifies the DIF features extracted based on shape (rectangular, circu¬ 
lar, I-section etc.), class (hole, slot, step) and type (positive, negative). With reference to 
the feature definition, positive features and negative features are obtained by adding and 
removing material respectively from the base_solid. Negative features without positive 
children map directly into machining domain. Positive features that are child features of a 
negative feature are split into the portion totally inside the negative feature volume and the 
portion outside. The portion outside the negative feature volume is treated as a separate 
positive feature then onwards. The portion of the positive child feature inside the negative 
feature and the negative feature are combined as a single negative machining zone. Positive 
features which are adjacent are clustered and enclosed with a box. Positive feature cluster 
and box together form a positive machining zone. The boxes corresponding to positive 
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machining zones and base_solid make the first stage solid. First, stock solid is reduce 
first stage solid through clipping (Rao Nalluri & Gurumoorthy 1993b; Rao Nalluri 19 
then enclosing box in each positive machining zone is clipped w.r.t. inside feature clu 
In clipping a outside solid w.r.t. an inside solid, the outside solid is split about the fi 
of the inside solid. The results of clipping are convex volumes that are required t( 
subtracted from the outside solid to get the inside solid. After the clipping, all posi 
features and the base_solid are created in the stock solid. As all negative features ha\ 
be either on base_solid or on one of the positive features, they can be machined any I 
after their shared faces are ready. Through clipping all part positive features are conve 
into negative machining features w.r.t. stock. These negative features are added to the 
features list for further processing. Figure 4 shows a part with positive DIF feature anc 
negative features obtained after mapping. 


7. Manufacturability evaluation 

Features extracted from the B-rep of the solid or obtained through the mapping pro 
described above are regions determined only by local topological and geometric cht 
teristics of the part. A visibility-based approach has been developed to find the gl 
attributes (accessibility) of features and determine how this affects the manufacturali 
of the part. An added gain from this approach has been the usefulness of the globa 
tributes in process planning. Local accessibility of a feature is first determined base 
the visibility of faces forming the feature and machining access directions are genen 
The global accessibility of each feature along the machining directions is then asse 
with respect to the part (Aditya Narayan 1995). Based on the global accessibility it 
mation it is possible to enumerate the machining sequences that will realise the part 1 
stock. This enumeration can then be pruned using other constraints such as dimensi 
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Figure 5. Visibility map (VJvlap) of features. 

references, set-up minimisation, tool availability etc. to generate machining process 

cal accessibility of a feature refers to the existence of directions from which all points 
erest on the feature are visible. In the context of feature-based manufacturing, the 
tive is to find the directions from which all the faces of a feature are completely 
e. These are directions in which a tool might be able to approach a feature. 

Jibility maps (VJMaps) (Woo & Chen 1992; Woo 1994) can be used to determine 
►cal accessibility of a feature. A feature is said to be locally accessible if there is a 
lion in which all the created faces of a feature are visible. As a point in the VJMap 
surface denotes a direction from which all the points on the surface are visible, for 
e created faces of the feature to be visible, the VJMaps of the created faces must 
ect. The valid directions of access for the feature will lie within this intersection of 
VJMaps. This region of intersection of the VJMaps of the feature’s created faces is 
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called the V_Map of the feature. Figure 5 shows the VJVlap of two features. For the V- 
shown on top, the V_Map is a finite region (indicated by the dotted line in figure 5). 1 
implies that there are many access directions for the feature. For the dovetail slot (boti 
of figure 5) the intersection of the V_Maps of the created faces are just two points 
are shown circled in the figure. The dovetail slot, therefore can be accessed for machir 
only from the two directions that are normal to the two end faces. 

Valid access directions are those which map to a point on the unit sphere that is abov 
on the planes corresponding to the created faces of the feature. If no VJVlap can be generj 
for a feature, the feature is not locally accessible. Such features are to be handled eithei 
changing the design of the feature to make them locally accessible or by resorting to spe 
manufacturing methods, such as the use of special tools. Parts with such features are 
completely manufacturable. At present such cases of local inaccessibility are flaggei 
the user. 

The feature VJVlap determines the local accessibility of features and is a characteristi 
the feature only. They are sensitive only to orientation of the feature and not to its posii 
in space. The feature VJVlap is a semi-infinite set of directions, from which a finite se 
directions has to be picked for evaluating global accessibility of the feature along tl 
directions, In actual machining practice, the choice of a particular access directioi 
based on heuristics. To generate as many relevant access directions as possible, candic 
directions, based on some common cases encountered in general machining, are tes 
Thus, normals to the created (CEFs and CSFs) and the shared faces (SEFs and SSFs) 
checked with the feature V_Map and are taken as valid access directions if they lie withi 
These directions form the initial set of access directions which will be checked for glc 
accessibility of features. 

7.1 Global accessibility 

Determination of global accessibility is a check of whether a generic tool, representabl 
a line, can reach the feature to machine it. Global accessibility of a feature does not ass 
absolute manufacturability of the feature. It is only a first-step evaluation before a ir 
detailed tool and manufacturing environment specific accessibility analysis (that inclu 
size effects) is done. 

The traditional method of determining global accessibility by ray tracing from the rej 
whose global accessibility is to be determined (Spyridi & Requicha 1990; Vandenbra 
& Requicha 1993; Lim & Menq 1994) is a computationally intensive task if carried 
for the whole feature. Further, ray tracing is susceptible to failure due to bad choice of 
spacing and poor resolution. We define a feature to be globally accessible, along a directi 
if the intersection of the. part or workpiece and the extrusion of the feature volume in i 
direction is NULL i.e. 

(iW nExt(VF)/) = 4> 

where i W = interior of workpiece and Ext( V/r)/ =?= extrusion to infinity of feature volui 
Vf along a direction / (see figure 6). Mill et al (1994) also use a volume-based appro; 
but their method does not yield correct results for directions of access that are not non 
to the assumed feature-machining face or for features such as dovetail slots. 
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Figure 6. Determination of global accessibility of a feature. 


'he premise for using the feature volume to determine feature accessibility is that the 
Die feature volume, which has to be removed for creating a feature, should be accessible 
hie tool. Figures 7 and 8 show the global accessibility of a feature along two directions, 
the direction in figure 7, the volume of intersection is NULL and hence the feature 
lobally accessible. For the direction in figure 8, it is not accessible, as a finite volume 
eturned. In both the figures, the first block shows the feature volume on the part and 
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8. Accessibility and process planning 

Information about global accessibility of features in a part can be used to sequence l 
features for machining. If a feature is globally accessible with respect to the part, tl 
that feature can be created in the stock independent of other features (figure 9). Also, i 
feature is not globally accessible then that feature can be created only after the creation 
a feature from whose volume it is accessible. In figure 10, features B and C are globa 
inaccessible features, whereas A is globally accessible. Features B and C can be crea 
only if they are accessible with respect to a volume which has already been created. Oi 
this precondition is satisfied they can be created in any sequence. Such features can 
said to have secondary accessibility as they are dependent on creation of other features 
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Part 






Interpretation 1 Interpretation 2 

Figure 10. Machining sequence interpretations of globally inaccessible features. 


becoming accessible. This dependency is not known a priori but has to be checked for the 
non-globally accessible features after the creation of every feature. 

These assertions are based only on the consideration of spatial interrelationships between 
features and no other geometric or machinability-related constraints are operative. Based 
on the above, a tree of machining sequences, based on accessibility, can be generated where 
each path from the root to a leaf is a possible machining sequence from the accessibility 
point of view. At the root of the tree is the stock and at each leaf will be the final part if the 
pait is manufacturable. At every level in the sequence tree, a feature will be created. The 
features at the first level are all globally accessible features as they can be created in the 
stock directly. At the other levels, the nodes will be the globally accessible features that 
have not been created at any node from the root to that node and features that gain secondary 
accessibility by the creation of features at ancestor nodes. An enumerated sequence tree 
is shown in figure 11 for a part with three features that are globally accessible and one 
feature that is globally inaccessible/exhibits secondaiy accessibility. 

Features with secondary accessibility can be manufactured only if a tool-machine com¬ 
bination can be found that will reach it by fitting into an already created feature. These 
checks can be built into a tool selection module as constraints. This tree structure gives 
an exhaustive enumeration of all the possible machining sequences based on global ac¬ 
cessibility. This allows the planner to consider all possible sequences in the search for an 
optimal process plan. There have been similar efforts reported (Nau et al 1992; Gupta & 
Nau 1993), that use similar volume-based checks. However, in these efforts, the features 
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Enumerated Sequence Tree 




Globally Accessible Features 
Secondary Accessibility Features 

A, B and D: Globally Accessible Features 

C: Globally Inaccessible/ 

Secondary Accessibility Feature 



Figure 11. Enumerated sequence tree for a sample part. 


and the volumes associated with them are pre-defined. Feasible machining sequences 
be generated from the tree by the application of machining domain constraints sucl 
dimensions, set-up minimisation and machining optimization, to mention a few. 

The authors have not encountered any earlier attempts at interpreting the dimensi 
as constraints on machining sequence during generation of process plans. An attempt 
been made to incorporate the dimensional information associated with the part design 
process planning approaches (Aditya Narayan 1995). 

The effect of specifying dimensional relationships between various entities in the j 
such as abstract data, faces and edges, are mapped to features. The features are sequeu 
for machining based on these relationships. In the present implementation, only com 
tional linear and angular dimensions are handled. The dimension is assumed to be speci 
between two entities of the part, either abstract (e.g. centerline of a hole) or real (e.g. f 
edge). This is a bidirectional relationship in which either entity may be referenced from 
other. No directionality of reference is assumed, though a feature or face may be ‘fon 
to be a reference face by not specifying another entity with it in the dimension struct 






Figure 12. Test part with dimensions. 


le sequence of machining based on dimensional data fulfills only the mandatory di- 
donal conditions specified by the part design. Further optimization of the process may 
irried out by considering the freedom offered in set-up generation, tool change, cost 
nisation etc. Also, constraints on the process will be imposed by tool and machine tool 
Dility, their availability and fixturing. This can be done within the framework of the 
il accessibility tree by evaluating the various sequences that are enumerated by the tree, 
ice the machining features have been sequenced, the volumes corresponding to the 
res can be found (Rao Nalluri et al 1995), Figure 13 shows the machining volumes 
spending to the creation of faces and features in each step of the sequence. Machining 
res provide the logic for selection and sequencing of operations. Machining volumes 
[de the geometric information for generating NC tool paths. The NC path generation 
lie in figure 1 reasons over the machining volumes and generates the tool path (cutter 
ion file) (Gurumoorthy 1996). In essence, once a component is modelled in the proto¬ 
system, NC tool paths required to realise the part from a given stock are automatically 
rated. 


Concluding remarks 

ave presented a framework based on features technology to support manufacturability 
lation and process planning. A prototype system as shown in figures 1 and 2 is under 
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Figure 13. Machining sequence. 
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Figure 14. Machining sequence (contd.). 
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development. Feature-based modelling based on direct editing, feature extraction, feature 
mapping and generation of sequence tree based on global accessibility information and 
automatic tool path generation have already been implemented for polyhedral solids. Direct 
editing of feature-based models and feature extraction have also been implemented for 
exact B-Rep models using the ACIS geometric kernel (Hari 1995; Gupta 1997). Currently 
the focus is on implementing constraints based on good machining practice (as followed 
on the shop floor) into both the evaluation and sequencing tasks. 

In this paper, limitations and bottlenecks in the feature technology have been brought out 
and remedies have been outlined. Defining form features through feature generation model 
is a novel approach which has significant potential for features technology. Separation of 
‘type’ and ‘shape’ reduces the effort involved in implementation of feature extraction and 
feature based modelling. The enumeration of DIF feature types gives a sound basis for 
design and implementation of feature-based systems. By removing (through updating) the 
recognised features, part geometry as seen by the feature extraction module progressively 
simplifies and eliminates problems in feature extraction (interacting/intersecting features) 
that have been reported in the literature. Use of features has been extended to geome¬ 
tric manipulation through development of algorithms for direct manipulation and editing 
of geometric representation. Manipulation of product data in mapping requires further 
investigation. 

It has been shown that issues such as global accessibility of features are vital not only for 
the evaluation of designs for manufacturability but also have an effect on the sequencing 
of machining operations. Currently, the accessibility analysis considers only the spatial 
relationships between features. For process planning, the effect of feature sizes is important 
for the evaluation of machine and tool capability and for optimising machining volume 
removal. Also, a complete tool accessibility check would require evaluating the exact shape 
and size of the tool and the feature. Algorithms to deteimine local and global accessibility 
have to be extended to handle the more general case of features with free-form surfaces. 
The current implementation considers dimensional data and references for sequencing of 
machining operations. Further work could focus on issues such as form tolerances, set¬ 
up minimisation, fixturing, tool and machine tool evaluation and machining practice and 
heuristics. The machining access directions determined based on the VJMap of features 
can be used for setup optimisation. 

Global accessibility of features enable identifying multiple options in machining fea¬ 
tures. Work is underway to identify multiple feature interpretations across domains. For 
the integration of CAD geometric model with application domains to be completely au¬ 
tomatic, redesign (i.e. modification of part geometry by applications) has to be possible. 
Some preliminary results are available in this area (Das et al 1996). The aim of the work 
presented in this paper has been to develop an algorithmic framework for process planning 
that is driven as much as possible by geometry and allows the incorporation of constraints 
and heuristics of the machining domain. It is hoped that further work in using DIF feature.s 
as the unifying representation for geometric and domain specific reasoning will enable cor- 
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Abstract. Scheduling is a key factor for manufacturing productivity. Effective 
scheduling can improve on-time delivery, reduce inventory, cut lead times, and 
improve the utilization of bottleneck resources. Because of the combinatorial 
nature of scheduling problems, it is often difficult to find optimal schedules, 
especially within a limited amount of computation time. Production schedules 
therefore are usually generated by using heuristics in practice. However, it is 
very difficult to evaluate the quality of these schedules, and the consistency of 
performance may also be an issue. 

In this paper, neai-optimal solution methodologies for job shop scheduling 
are examined. The problem is formulated as integer optimization with a “sepa¬ 
rable” structure. The requirement of on-time delivery and low work-in-process 
inventory is modelled as a goal to minimize a weighted part tardiness and earli¬ 
ness penalty function. Lagrangian relaxation is used to decompose the problem 
into individual part subproblems with intuitive appeal. By iteratively solving 
these subproblems and updating the Lagrangian multipliers at the high level, 
near-optimal schedules are obtained with a lower bound provided as a byprod¬ 
uct. This paper reviews a few selected methods for solving subproblems and for 
updating multipliers. Based on the insights obtained, a new algorithm is pre¬ 
sented that combines backward dynamic programming for solving low level 
subproblems and interleaved conjugate gradient method for solving the high 
level problem. The new method significantly improves algorithm convergence 
and solution quality. Numerical testing shows that the method is practical for 
job shop scheduling in industries. 

Keywords. Scheduling; Lagrangian relaxation; dynamic programming. 

Introduction 

eduling is a key factor for manufacturing productivity. Effective scheduling can im- 
/e on-time delivery, reduce inventory, cut lead times, and improve the utilization of 
leneck resources. Because of the combinatorial nature of scheduling problems, it is of- 
iifficultto obtain optimal schedules, especially within a limited amount of computation 
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time. Production schedules therefore are usually generated by using heuristics in practice. 
However, it is very difficult to evaluate the quality of these schedules, and the consistency 
of performance may also be an issue. A logical strategy is thus to pursue methods that 
can consistently generate good schedules with quantifiable quality in a computationally 
efficient manner. 

This paper examines the practical scheduling of job shops, a typical environment for the 
manufacture of low-volume and high-variety parts. In a job shop, parts with various due 
dates and priorities are to be processed on various types of machines. Job shop scheduling 
is to select the machines and beginning times for individual operations to achieve certain 
objective(s) with given machine capacities. In this paper, job shop scheduling is formulated 
as integer optimization with a “separable” structure. The requirement of on-time delivery 
and low work-in-process inventory is modelled as a goal to minimize a weighted part 
tardiness and earliness penalty function. Lagrangian relaxation (LR) is used to decompose 
the problem into individual part subproblems with intuitive appeal. By iteratively solving 
those subproblems and updating the Lagrangian multipliers at the high level, near-optimal 
schedules are obtained with a lower bound provided as a byproduct on tlie optimal cost. This 
paper reviews a few selected methods for solving subproblems and for updating multipliers. 
Based on the insights obtained, a new algorithm is presented that combines “backward” 
dynamic programming (BDP) for solving low level subproblems and interleaved conjugate 
gradient (ICG) method for solving the high level problem. The new method significantly 
improves algorithm convergence and solution quality. Numerical testing shows that the 
method is practical for job shop scheduling in industries. 

1.1 Literature review 

Given the economic and logistical importance of the scheduling problem, many of the 
early efforts centred on obtaining optimal schedules. Two prominent optimization meth¬ 
ods are the branch and bound method (Fisher 1973) and dynamic programming (e.g., 
Pinedo 1995). It was discovered that the generation of optimal schedules often requires 
excessive computation time regardless the methodology. Furthermore, job shop schedul¬ 
ing is among the hardest combinatorial optimization problems and is NP-complete (Garey 
& Johnson 1979). Production schedules therefore are usually generated by experienced 
shop-floor personnel using simple dispatching rules in practice. Many heuristic methods 
have been presented and implemented based on due dates, criticality of operations, opera¬ 
tion processing times, and machine utilization (e.g., Blackstone et al 1982). Many artificial 
intelligence (Al) approaches also use heuristics for scheduling (e.g., Kuziak 1990). These 
heuristics-based approaches usually generate feasible schedules quickly but it is very dif¬ 
ficult to evaluate the quality of the schedules. Also, most heuristics do not provide for 
iterative improvement of the schedules. 

Attempts to bridge the gap between heuristic and optimization approaches have also been 
undertaken (Adam et al 1988; Luh & Hoitomt 1993; Ventura & Weng 1995). In Adams 
et al (1988), for example, a heuristic for job shop scheduling was developed based upon 
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:he solution was fed back into the previously solved machine problem by a “local 
imization.” However, schedule evaluation could only be achieved through “selective 
leration.” Also, each operation has to be pre-assigned to a specific machine before 
luling, though the operation may be processed on different machines or different types 
achines. 

Luh Sl Hoitomt (1993), a Lagrangian relaxation framework was established for manu¬ 
ring system scheduling problems, and a practical method was provided. In the method, 
machine capacity and operation precedence constraints are relaxed by using Lagrange 
ipliers, and operation-level subproblems are formed and solved by enumeration. The 
ipliers are then updated at the high level by using a subgradient method. An improved 
on of the method considering bills of materials, with a modified subgradient method 
e high level was presented in Czerwinski & Luh (1994). 

uch progress has been made on the scope and performance of the LR-based method- 
y. A combined LR and heuristic method was developed for job shop scheduling with 
p-dependent setups and finite buffers in Luh et al (1995). The scheduling of batch 
lines with setup requirements was addressed in Luh et al (1997b). A “forwai'd” dy- 
ic programming (FDP) algorithm was embedded within the LR framework for job 
I scheduling in Chen et al (1995). In the method, only machine capacity constraints 
elaxed, and part level subproblems are formed and solved by using the FDR By doing 
the solution oscillation difficulties as reported in Czerwinski & Luh (1994) are alle- 
;d. Also, by relaxing less constraints, the dual cost should be a tighter lower bound, 
uh et al (1997), an LR-based method was developed for job shop scheduling with 
^rtain parts. A “backward” dynamic programming (BDP) was developed to solve the 
subproblems with random part parameters. 

3r high level algorithms, the slow convergence of subgradient methods was analysed, 
the facet ascending algorithm (FAA) was presented to improve the convergence in 
lastik & Luh (1993). The reduced-complexity bundle method (RCBM) was developed 
omastik & Luh (1996) which significantly reduces the computation complexity but 
itains the convergence of the conventional bundle methods. An interleaved subgradient 
lod (ISG) was developed in Kaskavelis & Caramanis (1995) to improve the efficiency 
le LR-based method. A review of those methods will be presented in § 3.3. 


Overview of the paper 

' 2, an integer optimization formulation with a “separable” structure for job shop 
iduling is presented. In § 3, the problem is decomposed into individual part subprob- 
b by relaxing machine capacity constraints following the approach in Chen etal (1995). 
w selected methods for solving subproblems and for updating multipliers are reviewed. 
td on the insights obtained, a new algorithm is presented that combines BDP for solv- 
iow level subproblems and an interleaved conjugate gradient (ICG) method for solving 
high level problem. In § 4, numerical results show that the new method outperforms a 
dous LR/SG method in convergence. Numerical testing for practical data sets shows 
the method can generate high quality schedules in a timely fashion. 


2. Problem formulation 


In a job shop, machines may have different processing capabilities. Machines with the 
same processing capability are grouped as a “machine type,” and all the machine type 
form a set denoted as H. The total number of machine types is thus \H\. There are 7 f 
parts with various due dates Di to be scheduled over a discretized time horizon K. Part | 
i {i = 0, 1,1) consists of nonpreemptive Jj serial operations with operation j ^ 

(j = 0,1,..., Ji - 1) of part i denoted by (i, j). An operation may-start only after its 
preceding operation has been completed, and requires a machine belonging to a given set | 
of eligible machine types Hij for a specified duration of time. Without loss of generality, ’ 
it is assumed that operations of part i are performed in the ascending order of operation 
index j. 

The time horizon consists of K time units, indexed by k (k = 0, I,..., K — 1). Each 
operation beginning time is defined as the beginning of the coiTesponding time unit, and 
each completion time the end of the time unit. The variables used in the problem formulation ^ 
are listed below. •• - 


^ijhk- 0-1 operation variable which is one if operation (/, j) is performed on machine 
type h at time k, and zero other. 

Pi : Part earliness weight. 

bij: Beginning time of operation (i, j). 

Cij : Completion time of operation (i, J). 

Di ; Due date of part i . 

E,; Earliness of part i, defined as E/ = max[0. Si - bio]- 
h: Machine type variable, h e H. 

Hij\ Set of machine types capable of performing operation (/, j). 

J: Objective function to be minimized. 

k\ Time index {k = 0, i,..., K — 1). 

Milk'. Capacity of machine type h at time k. 

Pijh'. Processing time of operation (f, j) on machine type h e Hij. 

Si : Desired raw material release time for part i . 

Tf. Tardiness of part i, defined as D = max[0, Ci^j.-i — Di]. 

Wi: Part tardiness weight. 


[ 

Assuming that the set of machine types, the number of parts, part due dates and weights, | 
operation processing time and time horizon are given, the constraints and objective function 
are explained below. 

2.1 Machine capacity constraints ’ 


In the literature (e.g.. Baker 1974; Adams 1988), the limited machine capacity is often ^ 
modelled by “disjunctive constraints.” Given a pair of operations (denoted as A and J?) to f 
be performed on a particular machine, the disjunctive consti'aints state that either B starts j 
after the completion of A or A starts after the completion of B. As a result, the number 1 
of disjunctive constraints increase drastically with the number of operations. Also, it is 
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•ed that each operation must be pre-assigned to specific machines, though an operation 
)e processed on different machines or different types of machines, 
defining a set of 0-1 operation variables Sijhk to represent the processing status of 
operation, the following machine capacity constraints are formed. The constraints 
hat the total number of operations being performed (active) on machine type h must 
,s than or equal to the capacity (Mhk) of machine type h at any time unit k, i.e., 

/-I Ji-l 

h€H- k = 0 ,...,K-\. ( 1 ) 

1=0 j =0 

j the 0-1 operation variable Sijhk is defined by 



if bij <k < Cij, 
otherwise. 


( 2 ) 


e number of machine capacity constraints equals the number of machine types times 
ne horizon K. Although the number of 0-1 operation variables is huge, these variables 
stermined once the machine types and beginning times of the operations are speci- 
They thus are not independent decision variables, and do not cause any complexity 
ulty. 


Operation precedence constraints 

)peration precedence constraints are represented by the following “conjunctive con- 
its.” These constraints state that an operation cannot be started until its preceding 
ition is finished, i.e., 

Cij-iAl<bij, i = 0 , 1 ,...,/- 1 ; 7 = 1 , 2 ,...,//-!. ( 3 ) 

ice operation (i, 7 — 1) is completed at the end of time unit Cij-\, and operation (/, 7) 
1 at the beginning of time unit btj , the term “ 1 ” is required in ( 3 ). For the same reason, 
jrm “1” also appears in the following processing time requirements. 

Processing time requirements 


processing time requirements state that each operation must be assigned the required 
int of time for processing on the selected machine type /z, i.e., 

Cij = bij + Pijh - 1, z = 0, 1,...,/- 1; 7 = 0, 1,...,//- 1; h € Hij. 

( 4 ) 

ith processing times specified, operation completion times Cij can be eliminated from 
iroblem formulation. For notational convenience, they still appear in later derivation. 


Objective function 

lus objective functions such as makespan have been used in the literature. Research 


ubciui uiau, sciy, luaRcspau uuLcna (.DiacKSLune ei ai lyoz;. in aaaiiion, me aaaiiiviiy oi 
the tardiness objective function facilitates the decomposition approach. 

Besides the on-time delivery, working-in-process (WIP) inventory is another major 
concern in practice. To reduce WIP inventory, a desired raw material release time Si for 
each part is derived based on part due date and total processing time of the part (sum of 
operation processing times of the part). An earliness term for each part is added to the 
tardiness objective function, representing the penalty for releasing raw material too early. 
The requirement for on-time delivery and low WIP inventory is thus modelled as a goal to 
minimize the weighted part tardiness and earliness penalties, i.e., 

/-I 

Jsy^(WiT2 + A£?). (5) 

i=0 

The square on tardiness reflect the fact that a part becomes more critical with each time 
unit after passing its due date. Similarly, square is applied to each earliness penalty teim. 
The objective function accounts for the priorities of the parts, the importance of meeting 
due dates and desired release times. 

The overall problem therefore is to minimize the part tardiness and earliness penalty 
function, subject to the above machine capacity and operation precedence constraints, i.e., 

7-1 

min J, withjs + ( 6 ) 

subject to 

7-1 Ji-\ 

AeH; k = 0 . K-l, (7) 

i=0 J=0 

Cij-I + 1 < bij, / = 0, 1,...,/- 1; ; 1,2,..., 7/ - 1. (8) 

The decision variables are the operation beginning times bij and the machine types hij 
for individual operations. Once bij and hij are selected, [cij], {7j }, {£■/}, and {Sjjhk] can 
be easily derived. 

3. Solution methodology 

3.1 Lagrangian relaxation 

Lagrangian relaxation (LR) is a mathematical programming technique for performing 
constrained optimization. Similar to pricing concept of a market economy, the Lagrangian 
relaxation method replaces “hard” coupling constraints (e.g., machine capacity constraints) 
by the payment of certain “prices” (i.e., Lagrange multipliers) for the use of machines 
at individual time units. The original NP-hard problem can thus be decomposed into 
many smaller and easier subproblems. The solutions of individual subproblems, when 
put together, may not constitute a feasible schedule since coupling constraints have been 
relaxed by the multipliers. These prices or multipliers are thus iteratively adjusted based 
on the degree of constraint violations following again the market economy mechanism. 
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)biems are then re-solved based on the new set of multipliers. In mathematical terms, 
I function” is maximized in this multiplier updating process, and values of the dual 
)n serve as lower bounds to the optimal feasible cost. At the termination of this 
lier updating process, simple heuristics are used to adjust subproblem solutions to 
e a feasible schedule satisfying all constraints. Heuristics can also be run after each 
zation iteration to check convergence or to provide candidate feasible schedules, 
ization and heuristics thus operate in a synergistic fashion to generate effective 
lies. The quality of the schedule can also be quantitatively evaluated by comparing 
t to the largest lower bound provided by the dual function. 

jsing Lagrange multipliers nhk to relax machine capacity constraints, the following 
i problem is obtained, 

’.d problem 


min L, 

{bijMj} 


with L = + A'E?) + E E ^'■.7 


id k=bij 


~ ^ ^hk^hki 
h,k 


(9) 


t to the operation precedence constraints (3), 

leriving (9), the fact Hk^Io ^hkkjhk = ^hijk is used, and the relaxed 

:m has bij and hij as its decision variables. After regrouping terms related to indi¬ 
parts, the relaxed problem can be decomposed into the following part subproblems. 


ubproblems 


Ji-l cij 

min Li , with t,- = Wi 3"? + ft eJ + Y. E ^'■0*’ ('0) 

;=0 k=bij 

It to the corresponding operation precedence constraints for part j. 
m (10), a part subproblem reflects the needs to balance tardiness penalty, earliness 
;y, and machine utilization costs. This part subproblem can be viewed as a multi-stage 
ization problem with each stage corresponding to an operation. Although solving the 
al problem by using dynamic programming (DP) is impractical, the decomposed part 
Dblem is not NP-hard, and can be efficiently solved by using DP as will be presented 
, 2 . 

L* denote the minimal subproblem cost of part i with given multipliers, the high 
Lagrangian dual problem is then obtained as below. 

problem 

maxD, withD = ^L;-^MM7rM. (11) 

\^hki : iTi. 


The Lagrangian dual function D is concave (Bertsekas 1995), and piece-wise linear, 
and consists of many “facets” (Tomastik & Luh 1993). We next present the resolution of 
part subproblems, followed by the updating of Lagrange multipliers. 

3.2 Dynamic programming 


The forward dynamic programming (FDP) algorithm presented by Chen et al (1995) can 
be used to solve a part subproblem in (10). It starts with the first operation of the part, 
and precedes to the last operation. In this paper, a backward dynamic programming (BDP) 
is developed with the goal to be further extended to handle uncertainties (e.g., uncertain 
arrival times, processing times, due dates etc., see Luh et al 1997). The BDP algorithm 
starts with the last stage, and compute the costs of the last operation (i, Ji — 1) for all 
possible bi^ _ i and /zy,. _ i: 




Vi,J ;-1 (bij ._ 1, /ij,y, _ 1) = Wi + Y. 

k=bir^ 


( 12 ) 


For other operations (/, j) , the cumulative costs are obtained by recursively solving the 
following DP equation subject to operation precedence constraints (3): 


Vij(bij, hij) — 


mm 


^l,y + l 72/, j+1 


^h{jk + Vij^\{bij^], hij^i) 

k—bi i 


==AijPiEf-\- Y^hijk + ^ min Vi,j+i(bij+u 

k=bij bij+i,hij+i 


(13) 


In the above, Aij is 1 if (/, j) is the first operation (j = 0) and 0 otherwise. The function 
Vij(bjj,hij) is the cumulative cost for all operations succeeding and including (/, j), and 
AijfiiEf 4- YkLi,.. Tthijk are the “stage-wise” costs. The algorithm starts from the last 
stage and moves backwards till the first stage is reached. The optimal subproblem cost 
Lf is then obtained as the minimal cumulative cost at the first stage. Finally, the optimal 
beginning times bij and machine types selected hij for operations can be obtained by 
tracing forwards the stages. Similar to FDP, the computation complexity of the above BDP 
algorithm is 0(K E/Li 1^/yl) (Luh etal 1997). 


3.3 Solving dual problem 


3.3a Subgradient methods: As mentioned above, the Lagrangian dual function is con¬ 
cave and piece-wise linear. Existing methods for optimizing the dual function fall roughly 
into three classes: subgradient, cutting plane, and bundle methods. Of these, subgradient 
methods are commonly used to update the Lagrange multipliers (i.e., to maximize the 
dual function) because of their simplicity, the speed for computing a direction, and the 
global convergence property. With the subproblem solutions for given multipliers izhk, the 
subgradient g of the dual function D is^ calculated by 
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7-1 Ji-l 

ghk = '£.Yl^‘m-'^hk, heH; k = Q,...,K-\, 

i=Q j=0 


(14) 


where ghk is an element of the subgradient. In subgradient methods, multipliers are updated 
along the direction of the subgradient with the step size determined by 


a 


n 


D* - /)" 


0 < y <2 


(15) 


where D* is the optimal dual cost, and a;”, D" and g" are respectively the step size, dual 
cost and subgradient at iteration n. As shown in Tomastik & Luh (1993), subgradient 
methods often zigzag across a ridge (intersection of some facets) of the dual function. The 
slow convergence rate (less than linear) of the subgradient methods causes these methods 
to require many iterations to reach an optimum. 


3.3b Facet ascending algorithm: By recognizing that the Lagrangian dual function is 
polyhedral concave, and is made up of many facets, the facet ascending algorithm (FAA) 
finds the intersection of adjacent facets (Tomastik & Luh 1993). A subgradient of one of the 
facets is then projected on the intersection to obtain an ascending direction, and a line search 
technique is used to determine how far to move along the direction. The FAA avoids the 
zigzagging behaviour of subgradient methods, and shows improved convergence. For large 
problems, an intersection is usually formed by many facets. Finding such an intersection 
is often difficult and requires many dual function evaluations which are “computationally 
expensive.” Furthermore, the ridges are short, causing slow convergence. 


3.3c Bundle methods: The bundle method (e.g., Hiriart-Urruty & Lemarechal 1993) 
has the fastest convergence rate among the three classes of methods. It accumulates and 
utilizes the subgradients of points within a neighbourhood of the current iterate to find an 
6 -ascent direction (along which the function value can increase at least by 6), or to detect 
within 6 of the dual optimum (e-optimal). Finding such a direction or detecting e-optimal, 
however, requires solving a number of quadratic programming problems with considerable 
complexity. To reduce the complexity while maintaining the convergence of the bundle 
method, the reduced-complexity bundle method (RCBM) finds an 6-ascent direction by 
performing a projection of a subgradient onto an appropriate subspace formed by the 
subgradients in the bundle (Tomastik & Luh 1996). Along the 6-ascent direction, a line 
search technique is then used to determine the step size for updating multipliers. Similar to 
FAA, the large number of dual function evaluations required to accumulate the subgradients 
and to perform line search is very time consuming, and hinders the applicability of the 
RCBM to very large problems. 

3.3d Interleaved subgradient method: The iterative resolution of the dual problem re¬ 
quires the dual function to be evaluated many times, and each function evaluation involves 
solving all the subproblems once (called one iteration). These dual function evaluations are 
extremely “expensive” for large problems. For example, it takes about 68% of total CPU 
time for a case with 82 parts and 14 machines. To efficiently utilize the expensive function 


evaluations, an interleaved subgradient (ISG) method has been developed (Kaskavelis & 
Caramanis 1995). Instead of solving all subproblems before updating multipliers, the ISG 
method updates multipliers after solving each subproblem. At the high level, the multipli¬ 
ers are updated along the direction of the subgradient. Numerical results show that the ISG 
method converges much faster than a subgradient method, though algorithm convergence 
has not yet been established. 


3.3e Interleaved conjugate gradient method: As mentioned earlier, the dual function 
is concave, piece-wise linear, and consists of many facets. Each possible solution of the 
relaxed problem corresponds to a facet. Because of the combinatorial nature of the orig¬ 
inal problem, the number of possible solutions of the relaxed problem and therefore the 
number of facets increases drastically as the problem size increases. The dual function 
thus approaches a smooth function. This “smoothness” of the dual function motivates the 
use of optimization methods for smooth functions. 

Among the methods for optimizing smooth functions, conjugate gradient methods have 
attractive convergence properties and computation efficiency. The conjugate directions are 
generated by 




(g")^ g" 


n = 1,2,- 

(16) 


where d’^ and g" are the conjugate direction and gradient at iteration n. The step size 
for updating multipliers is determined by performing a line search along the conjugate 
direction. 

By incorporating the “interleave” concept with the conjugate gradient method, an in¬ 
terleaved conjugate gradient (ICG) method has been developed that utilizes the “smooth” 
property of the dual function and efficiency of the interleaved method for problems of large 
sizes (e.g., 1000 multiphers or more). In this paper, the ICG method is used to update the 
multipliers. The ICG algorithm is summarized as follows. 

50 Given the initial multipliers, solve all the part subproblems, and compute the dual 
cost and subgradient. Update multipliers along the direction of the subgradient. Set 
subproblem index .y = 1. 

51 Solve subproblem s while keeping other subproblem solutions unchanged. Compute 
the “surrogate” dual cost and subgradient according to (11) and (14) with the latest 
available subproblem solutions. 

52 Compute conjugate direction by (16) with the surrogate subgradient, and update multi¬ 
pliers along the direction. Since only one subproblem is solved for a set of multipliers, 
line search cannot be used to determined the step size. The step size is therefore still 
computed according to (15) with D* replaced by the lowest feasible cost obtained up 
to the current iteration. 

53 Increase 5 by one. If j is larger than the total number of subproblems, reset ^ to 1. Go 
to SI. 
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3.4 Constructing feasible schedule 

The solutions to part subproblems, when put together, are generally associated with an 
infeasible schedule, i.e., capacity constraints might be violated at some time periods. A 
feasible schedule is constructed by using a list scheduling heuristic. In the list scheduling 
procedure, a list of immediately performable operations is created, and maintained in the 
ascending order of their beginning times from part subproblem solutions. Operations are 
then scheduled on the required machine types according to this list as machines become 
available. If the capacity constraint for a particular machine type is violated at time k, a 
greedy heuristic determines which operations should begin at that time and which ones 
are to be delayed by one time unit. The subsequent operations of those delayed ones are 
then delayed by one time unit if precedence constraints are violated. The process repeats 
until the last operation in the list. 

The cost of the feasible schedule J is an upper bound on the optimal cost J*. The optimal 
dual value D*, on the other hand, is a lower bound on J*. Since it is usually difficult to find 
J* and D*, the (relative) duality gap(J — D)/D is often used as a measure of the quality 
of the feasible schedule. 

4. Numerical results 

The new method that combines BDP and ICG within the LR framework has been imple¬ 
mented using the object-oriented programming language C + H-, and extensive testing has 
been performed. Four test cases ai'e presented below. The first two cases are to demon¬ 
strate that the new LR/BDP/ICG method has better convergence than the previous LR/SG 
method with both machine capacity and operation precedence constraints relaxed and with 
a subgradient method at the high level. Case 2 also shows that the LR/BDP/ICG method 
generates a better schedule than a heuristic method that combines the “first come first 
serve (FCFS)” and “shortest processing time (SPT)” rules (called FCFS/SPT). The next 
two cases demonstrate that the LR/BDP/ICG method is applicable for solving scheduling 
problems of realistic sizes. In presenting the results for both LR/SG and LR/BDP/ICG 
methods, an iteration corresponds to solving all subproblems once. 

The four cases are tested on a Sun Sparc 10 workstation. In the testing, all multipliers 
are initialized at zero. The time horizons are automatically generated based on machine 
availabilities and part processing requirements. The step size factor y in the LR/BDP/ICG 
method is initialized to a specific value, and adaptively adjusted based on information 
obtained in the iterative process. 


Case 1: This test case is to demonstrate that the LR/BDP/ICG method generates a tighter 
lower bound than the LR/SG method. There are two machines of different types and two 
parts. Part one has two operations with processing times 3 and 2, respectively, and part two 
also has two operations with processing times 1 and 4, respectively. The first operations of 
both parts require machine type 0, and the second operations require machine type 1. The 
due dates are zero. The tardiness penalty weights are 1, and there is no earliness penalty. 
The results are summarized in table 1. 


/LJl. 
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Table 1. Testing results for case 1 (y = 0.5). 


Method 

Iteration 

Dual/J 

Duality gap 

CPU (s) 

LR/SG 

2000 

44/52 

18% 

3.00 

LR/BDP/ICG 

17 

52/52 

0% 

0.01 


It can be seen that botli methods generate optimal schedule with the cost J = 52 
which equals to the lower bound obtained by LR/BDP/ICG. The dual cost by using 
LR/BDP/ICG is thus optimal. The dual cost obtained by LR/SG (= 44) is also optimal 
for the corresponding dual function, as verified by using a LR/RCBM method. This test 
case therefore shows that by using the above LR/SG method, there exists an inherent gap 
(denned as the gap between the optimal dual cost and optimal feasible cost). It also shows 
that a tighter bound is obtained when the operation precedence constraints are not relaxed. 


Case 2: This test case is to demonstrate that the LR/BDP/ICG method outperforms the 
LR/SG method and the FCFS/SPT method. In this case, there are three machine types 
with one machine each, and four parts with a total of twelve operations. For all the parts, 
the due date and weight are -1 and 5 respectively, and there is no part earliness penalty. 
Operation processing times and required machine types are listed in table 2a, and the time 
horizon is 30. Testing results are summarized in tables 2b and 2c. 

Since all the weights are integer, the cost of a feasible schedule should also be an 
integer. With the lower bound 2374.7 and feasible cost 2375 obtained by LR/BDP/ICG, 
the schedule obtained must thus be optimal. It can also be seen from table 2b that the 
LR/BDP/ICG method significantly speeds up convergence. 

This case is also tested by using the FCFS/SPT method. In the FCFS/SPT method, 
operations are performed according to the FCFS rule. When several operations come to 
a machine at the same time, the SPT rule is used to determine the sequence of the these 
operations. The cost of the resulting schedule is 2852, which is significantly higher than 
the optimal feasible cost (= 2375). The schedule obtained by FCFS/SPT as well as the 
optimal schedule is presented in table 2c. 

The following two cases draw data from industries, and are tested by using the LR/BDP/ 
ICG method. The purpose is to demonstrate that the method is applicable for solving 
practical scheduling problems. A few performance metrics as well as the feasible costs 
at some iterations are evaluated to measure the schedule quality. The metrics are defined 
below. 

Makespan: the duration of time for processing all the parts. 

Maximum work-in-process inventory: the maximum number of parts in processing at 
a time unit. 

Average work-in-process inventory: the average number of parts in processing over the 
makespan. 

Average lead time: the average elapse time between part beginning and completion 
times for all parts. 
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Table 2a. Input data for case 2 (y = 0.5). 


Part 


(Processing time/Machine type) 


Operation 0 

Operation 1 

Operation 2 

Part 0 

4/0 

3/1 

2/2 

Part 1 

1/1 

4/0 

4/2 

Part 2 

3/2 

2/1 

3/0 

Part 3 

3/1 

3/2 

1/0 


Table 2b. Testing results for case 2. 

Method 

Iteration 

Dual cost 

J 

Duality gap 

CPU (s) 

LR/SG 

2000 

2179 

2375 

9.0% 

4.0 

LR/BDP/ICG 

100 

2374.7 

2375 

0.1% 

0.1 


Table 2c. Feasible schedules for case 2. 

O', j) 

h 


LR/BDP/ICG 

FCFS/SPT 

bij 

Cij 

bij 

Cij 

(0,0) 

0 

0 

3 

0 

3 

(0,1) 

1 

4 

6 

6 

8 

(0. 2) 

2 

7 

8 

12 

13 

(1,0) 

1 

3 

3 

0 

0 

(1,1) 

0 

4 

7 

4 

7 

(1.2) 

2 

9 

12 

8 

11 

(2,0) 

2 

0 

2 

0 

2 

(2, 1) 

1 

7 

8 

4 

5 

(2, 2) 

0 

9 

11 

9 

11 

(3,0) 

1 

0 

2 

1 

3 

(3, 1) 

2 

3 

5 

4 

6 

(3,2) 

0 

8 

8 

8 

8 


Case 3: This case is to show that the method selects machine types for individual oper¬ 
ations as well as their beginning times. In this case, there are eleven machine types with a 
total of 16 machines and 18 parts with various due dates and weights. A part may have up 
to 17 operations, and the total number of operations is 159. An operation may be performed 
on one of up to four different machine types. With a time horizon 1086, the testing results 
are summarized in table 3a, and the performance metrics in table 3b. 

It can be seen that both the lower bound and feasible schedule keep on improving as the 
number of iterations increases. It is also shown in table 3b that the schedule obtained at 
iteration 400 has better performance than the schedule at iteration 1. 


Case 4: This case is to demonstrate the capability of the LR/BDP/ICG method for 
scheduling problems with large sizes. In this case, there are eight machine types with a 
total of 14 machines. Each machine type has one or two identical machines. A total of 82 


Iteration Dual cost J Duality gap CPU (s) 


1 19339 

200 37394 

400 37456 

86988 

40715 

40170 

350% 

8.9% 

7.2% 

0.7 

128 

261 

Table 3b. Performance metrics. 

Iteration 

1 

200 

400 

Makespan 

782 

544 

546 

Maximum work-in-process inventory 

18 

16 

16 

Average work-in-process inventory 

8.16 

8.20 

8.09 

Average lead time 

354 

247 

245 


operation can be performed on a specific machine type. Testing results are summarized in 
table 4a, and the performance metrics in table 4b. 

The results in tables 4a and 4b show the iterative improvement of the LR/BDP/ICG 
method on dual costs, feasible costs and performance metrics. 

From the results of cases 3 and 4, it can be seen that significant improvement on the 
schedule quality and lower bound is obtained in the first 200 iterations. The improvement 
slows down from iteration 200 to 400. It is thus not needed to run the algorithm for a long 
time to get high quality schedules. 

As mentioned above, the computation complexity of the BDP algorithm for solving part 
subproblems is 0(K i I Htj j). It is observed from testing that the resolution of subprob¬ 

lems takes most of the CPU time in LR/BDP/ICG. The complexity of the LR/BDP/ICG 
algorithm is thus dominated by BDP. As shown in table 5, the computation time for 200 
iterations increases almost linearly with K l^ul- 

Lagrange multipliers reflect the “price” information for using machines. In day- 
to-day scheduling, the multipliers associated with the previous schedule can initialize 
the algorithm to generate a new schedule. Since the production in a job shop may not 
change much from day-to-day, the computation time for finding a good schedule will 


Table 4a. 

Testing results for case 4 {y 

= 0.1). 



Iteration 

Dual 

J 

Duality gap 

CPU (s) 

1 

84.5 

46487 

54914% 

3 


28837 

38843 

34.7% 

542 


29921 

37161 


1086 


Table 4b. Performance metrics of feasible schedule. 

Iteration 

1 

200 

400 

Makespan 

1049 

1050 

1047 

Maximum work-in-process inventory 

42 

30 

32 

Average work-in-process inventory 

22.3 

15.4 

16.8 

Average lead time 

322 

219 

240 







An optimization-based algorithm for job shop scheduling 


255 


Table 5. Computation time analysis for cases 3 and 4. 


Case 


K 


CPU (s)/200 iter. 

3 

269 

1086 

292 

128 

4 

752 (269 X 2.8) 

2068 (1086 X 1.9) 

1555 (292 X 5.3) 

542 (128 X 4.2) 


decrease vastly (roughly by 2/3 according to testing experience) with initialization 
procedure. 


5. Conclusions 

In this paper, near-optimal solution methodologies for job shop scheduling are examined, 
and many insights are provided on a few selected methods for solving subproblems and 
for updating multipliers. A new algorithm is presented that combines backward dynamic 
programming for solving low level subproblems and interleaved conjugate gradient method 
for solving the high level problem. The new method significantly improves algorithm 
convergence and solution quality. Numerical testing for practical data sets shows that the 
LR/BDP/ICG method can generate high quality schedules in a timely fashion, and it is 
practical for job shop scheduling in industries. 
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Abstract. Effective and efficient scheduling in a dynamically changing en¬ 
vironment is important for real-time control of manufacturing, computer, and 
telecommunication systems. This paper illustrates the algorithmic and analy¬ 
tical issues associated with developing efficient and effective methods to up¬ 
date schedules on-line. We consider the problem of dynamically scheduling 
precedence-constrained jobs on a single processor to minimize the maximum 
completion time penalty. We first develop an efficient technique to reoptimize 
a rolling schedule when new jobs arrive. The effectiveness of reoptimizing the 
current schedule as a long-term on-line strategy is measured by bounding its 
performance relative to oracles that have perfect information about future job 
arrivals. 

Keywords. Scheduling; design and analysis of algorithms; heuristics. 


1. Introduction 

Planning and scheduling dynamic systems with random job arrivals, failures, and pre¬ 
emption are very challenging tasks. Typically, since future events cannot be forecast with 
enough detail and accuracy, planners often use on-line scheduling strategies. Consider, 
for instance, a production system with random job arrivals. On-line methods apply when 
detailed information regarding a job’s processing requirement is revealed only at its release 
time. Thus, each release time represents an epoch at which the existing schedule is revised 
to reflect the new information. One on-line scheduling strategy consists of reoptimizing 
the current “rolling” schedule at each job arrival epoch using a deterministic scheduling al¬ 
gorithm that only uses information about the current system and workload status. We refer 
to this scheduling strategy as on-line reoptimization. This strategy of reacting to changes 
in system status (job arrival or completion, processor failure etc.) by reoptimizing and 
updating the current schedule raises two issues. 
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First, given an optimal schedule of n tasks, can we devise an efficient method to revise 
this schedule, say, when a new task enters the system? Intuitively, since the existing n-job 
schedule contains useful information, exploiting this information to adjust the schedule is 
likely to be more efficient compared to reconstructing the optimal {n + l)-job schedule 
from scratch. We refer to the latter method as a zero-base algorithm, while a method that 
exploits current schedule information is an updating algorithm. Computer scientists have 
emphasized this issue of relative efficiency of updating methods in the context of certain 
geometric and graph problems by developing specialized data structures and updating al¬ 
gorithms (see, for example, Spira & Pan 1975, Chin & Houck 1978, Even & Shiloach 
1981, Overmars & van Leeuwen 1981, Frederickson & Srinivas 1984, and Frederickson 
1985). In conti'ast, the research on deterministic resource scheduling (see, for example, 
Graham et al 1979) focuses primarily on zero-base algorithms. In this paper, we illus¬ 
trate the algorithmic issues in dynamic reoptimization by developing an efficient updating 
method for one class of single-machine scheduling problems. 

Efficient schedule updating methods are especially important for real-time planning and 
control Consider, for instance, the following “bidding” scheme for assigning tasks in a 
distributed processing system (see, for example, Ramamritham & Stankovic 1984, Zhao & 
Ramamritham 1985, Malone et al 1988). Jobs with varying processing requirements and 
due dates arrive randomly at different processor locations. Each processor maintains and 
updates its own local schedule. When a new job enters the system (or when a processor 
fails), the source node (or a central coordinator) queries the other processors to determine 
their expected completion time before deciding where to dispatch the job. To formulate 
its response, each target processor must adjust its current schedule to accomodate the new 
job and determine its tentative completion time. Subsequently, when the job is awarded 
to a processor, the selected processor must again update its schedule. Given the possi¬ 
bly large volume of job announcements and reassignments, devising efficient updating 
algorithms to accomodate new jobs is clearly critical for this type of real-time control 
mechanism. 

In addition to updating efficiency, we are also interested in the effectiveness of the 
on-line reoptimization strategy. In particular, what is the relative performance (i.e., close¬ 
ness to optimality) of schedules obtained through on-line reoptimization compared to 
an “optimal” off-line decision procedure that has perfect information about the future? 
Recently, computer scientists have developed a standardized approach to study this perfor¬ 
mance characteristic of on-line methods. The approach seeks a worst-case measure called 
competitiveness to evaluate solution effectiveness. We illustrate this mode of analysing 
effectiveness using our single machine scheduling example. 

Studying efficiency and effectiveness issues for on-line reoptimization required a judi¬ 
cious choice of the scheduling context. In particular, both the structure and performance 
of on-hne updating methods depend strongly on the scheduling objective. Consider, for 
instance, the problem of scheduling a single machine to minimize maximum tardiness 
for unrelated jobs with identical release times. The earliest due-date rule finds the optimal 
schedule for this problem (Jackson 1955). Given an earliest due-date (EDD) schedule for n 
jobs, the updating problem consists of constructing a new EDD schedule (by adjusting the 
current schedule) to accomodate a new job. If n denotes the number of currently scheduled 
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jobs, re-sorting the {n -|- 1) jobs to construct the new EDD schedule requires 0{n logn) 
operations. However, updating can be performed much more efficiently if we use a heap 
structure (see, for example, Tarjan 1983 and Aho et al 1974) to store the current schedule. 
Updating the heap when a new job arrives merely involves inserting the new item and 
rebalancing the heap, which requires OQogn) effort. For other scheduling objectives, the 
updating method is not so obvious, and the n-fold computational improvement may not 
be possible. And, of course, for NP-hard scheduling problems (e.g., minimizing sum of 
completion times for jobs with ai'bitrary release dates) the updating problem is not likely 
to be polynomially solvable either. 

In this paper we develop and analyse the worst-case performance of an on-line updat¬ 
ing method for a single machine-scheduling problem with precedence-constrained jobs, 
where the objective consists of minimizing the maximum completion time penalty over 
all jobs. In the classification scheme proposed by Graham et al (1979), we consider the 
l/prec//max problem. Lawler (1973) proposed an Oin^) zero-base algorithm to con¬ 
struct an optimal w-job schedule for this problem. Subsequently, Baker et al (1983) gen¬ 
eralized this algorithm to the case where jobs have arbitrary but known release dates, 
and preemption is permitted. For the 1 /prec/Zmax problem, we focus on a special class 
of penalty functions that satisfy a consistency property defined in § 2. Several penalty 
functions such as linear completion time and tardiness penalties satisfy this property. 
For this class of scheduling problems, § 3 first describes a new zero-base algorithm 
called the Forward algorithm (unlike Lawler’s algorithm, this method schedules jobs from 
front to back) with 0(m -f- nlogn) worst-case time-complexity, where m denotes the 
number of arcs in the precedence graph. Subsequently, we develop an updating ver¬ 
sion of the Forward algorithm that uses information about the current n-job optimal 
schedule to optimally add a new job. If the new job has n'{< n) ancestors, and the 
precedence subgraph induced by these ancestors has m'{< m) arcs, the computational 
complexity of the Forward updating algorithm is 0{m' -f n'). Results of computer sim¬ 
ulations reported in § 4 confirm that, in practice, the Forward updating procedure re¬ 
quires significantly lower computational time than applying the zero-base algorithm to 
construct the (n -f l)-job optimal schedule. Section 5 analyses the competitiveness of 
on-line reoptimization for selected penalty structures. We show that the method is 2- 
competitive (i.e., its worst-case performance ratio relative to the optimal, perfect infor¬ 
mation schedule is bounded above by a factor of 2) for a delivery-time version of the 
scheduling problem (without preemption), and when the penalty function is subadditive 
(with preemption). 

This paper makes several specific contributions for the 1 /prec//max problem with consis¬ 
tent penalty functions. In particular, we: (i) propose a new Forward algorithm; (ii) develop 
an updating (on-line) version of the Forward algorithm; (iii) empirically demonstrate the 
computational benefits of using updating algorithms (instead of zero-base algorithms) to 
perform schedule adjustments; and, (iv) analyse the competitiveness of on-line reopti¬ 
mization for some special cases. However, our broader purpose is to use the 1/prec//max 
problem as an example to motivate the need for further work in the general area of ef¬ 
ficient and effective schedule updating methods, and to illustrate the issues that arise in 
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2. Problem description and notation 

The l/prec/Zmax problem consists of scheduling n jobs on a single machine, subject to 
precedence constraints on the jobs. Let pj denote the processing time required for job j. 
The job precedence constraints are specified via a directed, acyclic precedence graph G 
whose nodes correspond to the jobs; the graph contains a directed arc {i, j) from node 
i to node j if job i is an immediate predecessor of job j. For convenience, assume that 
jobs are indexed from 1 to n, with i < j if job i precedes job j. Let m denote the number 
of arcs in the precedence graph. We assume that the precedence graph is stored as a 
linked list requiring O (m) storage, with pointers from every job to each of its immediate 
predecessors. Let Bj denote the set of all immediate predecessors of job j. Job i is said 
to be an ancestor of job j if the precedence graph contains a directed path from i to j. 
Let Aj 2 Bj denote the set of all ancestors of job j. Each job j carries a non-decreasing 
penalty Junction fjitj) that depends on its completion time tj. The scheduling objective is 
to minimize /max = Max{/, (tj): j = 1, 2 ,... n}. 

Lawler (1973) developed the following 0(n^) zero-base algorithm to solve the 
l/prec//max problem. The method iteratively builds an optimal sequence by schedul¬ 
ing jobs in reverse order, i.e., it first identifies the job to be processed last (i.e., in position 
n of the schedule), then the job in position (n — 1), and so on. At stage k, let Qi^ denote 
the set of k currently unscheduled jobs, and let Tk denote the cumulative processing time 
for all jobs in Qk, i.e., Tk = T^ipj : j € Q*}. Also, let Rk be the subset of jobs in Qk 
whose successors, if any, have all been already scheduled; we refer to jobs in Rk as the set 
of eligible jobs at stage k. During stage k, the algorithm assigns to position k the eligible 
job j* € Rk with minimum penalty at time Tk, i.e., fj*{Tk) = Mm{ fj{Tk) : j e Rk}- The 
procedure terminates at the end of stage 1. Since each step requires 0{n) operations to 
identify the eligible job with minimum penalty, the overall complexity of the algorithm is 
O(n^). 

While Lawler’s algorithm applies to arbitrary penalty functions, we will focus on a 
special class of penalty functions that satisfy the following consistency condition: A set 
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Figure 2. Special consistent penalty functions. 


of functions /i(.), / 2 (.),. •., /«(•) is said to be consistent if, for every pair of indices 
i, j e {1,2,, n}, either fi(t) < fj{t) or fi{t) > fj(t) for all values of completion 
time t. 

As figure la shows, consistent functions do not intersect; Figure lb shows two penalty 
functions that are not consistent. Several natural penalty functions satisfy the consistency 
property. Examples, shown in figure 2, include (i) the weighted (linear and quadratic) 
completion time criteria fj{t) = wjt or fj(t) = wjt^ (ii) the lateness penalty fj(t) — t — 
dj , where dj is the due date for job j, and (iii) the tardiness penalty fj (t) — max{0, t — dj). 
Jobs with consistent penalty functions have the same relative ranking (say, increasing order 
of penalties) for all completion time values. Hence, we will sometimes omit the completion 
time argument, and denote as f > fj the fact that job i has a higher penalty than job j 
(for all completion time values). 

For convenience, we will assume that the penalty functions for the different jobs are 
distinct, i.e., either/i > fj or f < /y.Thus,thejob with the maximum penalty will always 
be unique. Our Forward algorithm requires only the relative order of jobs with respect to 
the penalty functions rather than the exact penalty values for different completion times. 
Hence, ranking the jobs in order of penalties is sufficient. 


j, rorwaru aigoriuuii loi i/picu/ jmax piuuiciii wiui pdiaii^ luucLiuuai 


This section first describes a new zero-base procedure called the Forward algorithm to find 
the optimal n-job schedule for the 1 /prec//max problem with consistent penalty functions. 
Unlike Lawler’s algorithm, the new method schedules jobs from front to back (i.e., it 
assigns jobs to earlier positions first). We prove the correctness of this algorithm, and 
demonstrate how it facilitates updating the schedule when a new job enters the system. 

3.1 The Forward zero-base algorithm 

The Forward algorithm is motivated by the following intuitive argument. Recall that, for 
consistent penalty functions, the relative ordering of jobs (in terms of their penalties) 
does not vary with time. Hence, if jobs are not constrained by precedence restrictions, we 
can minimize /max by scheduling the jobs in decreasing order of penalty. However, this 
decreasing-penalty order may violate some precedence constraints. To satisfy the prece¬ 
dence constraints, consider the following ‘natural’ scheme to selectively (and parsimo¬ 
niously) advance jobs: Start with the decreasing-penalty order as the candidate sequence; 
examine jobs from front to back in this sequence, and ensure precedence feasibility for 
each job j by advancing (i.e., scheduling immediately before job j) every ancestor that is 
currently scheduled after job j. This procedure effectively attempts to deviate as little as 
possible from the decreasing-penalty order by advancing only the essential low-penalty 
jobs that must precede the high-penalty jobs. As we demonstrate next, this principle forms 
the basis for the Forward algorithm, and gives the optimal n-job schedule. 

To describe and prove the validity of the Forward algorithm, we use some additional 
notation. For any subset of jobs 5, let Bj{S) be the set of all immediate predecessors of 
job j belonging to subset S, i.e., Bj{S) = Bj H S. Similarly, Aj{S) — Aj OS denotes the 
set of ancestors of job j in subset S. The Forward algorithm relies on the following result. 
Recall that we have indexed jobs such that i < j if job i precedes job j. 

PROPOSITION 1 

For any subset of jobs S, let j* be the job in this subset with the maximum penalty. 
Then, subset S has an optimal schedule, denoted as 11(5'), that assigns job j* to position 
{lAj*(5)l -t- 1], and all its ancestors to the first lAy’t=(S)| positions in increasing order of 
job indices (where |AI denotes the number of elements of the set A). 

Proof The first part of the proposition states that the subset S must have an .optimal 
schedule that processes the maximum-penalty job j* as soon as possible, i.e., this schedule 
first processes all ancestors of job j*, followed immediately by j*. We prove this result 
using an interchange argument. Consider an alternative optimal schedule H^' that does 
not satisfy this property. Let job j* be scheduled in position k > |Ay*(5)| -j- 1, and let 
job j' ^ Aj*(S) be a non-ancestor that is scheduled closest to, but before, job j*. Let 
k' denote the position of job / in schedule O", k' < k. By our choice of k', all jobs in 
positions (k' -h 1) to (k — 1) must be ancestors of j*. Also, job j' is not an ancestor for 
any of these jobs; otherwise, j' would be y*’s ancestor as well. Finally, since job j* has 
the maximum penalty.in the set S, fj* (t) > fj>(t), where t is the current completion time 


of job j*. Consider now the new schedule obtained by postponing job j' to position k, and 
advancing all jobs in positions A:'' + 1 to ^ by one position. By our previous observations, 
the new schedule must be feasible; furthermore, since job j* has a higher penalty than job 
J', the new schedule does not increase the maximum penalty of the schedule. By repeating 
this process until all non-ancestors of job j* are postponed beyond 7 *, we get an optimal 
schedule that satisfies the condition of the proposition. 

Now, job j* has the maximum penalty among all jobs in S. Thus, the penalty incurred for 
j* must exceed the penalty for each of its ancestors (since these are completed earlier and 
have lower penalty functions), regardless of their relative order in positions 1 to \Aj* {S) |. 
To be feasible, however, the assignment of these ancestors must satisfy the precedence 
constraints. Since jobs are numbered in order of their precedence, scheduling the ancestors 
in increasing index order gives a feasible schedule. □ 

Proposition 1 suggests the following iterative scheduling procedure: first, identify the 
job 7 * with maximum penalty, schedule it in position (|A 7 *( 5 ')| -f- 1 ), and assign all its 
predecessors to positions 1 through | Let S' Q S denote the remaining set of jobs 

(which are not ancestors of 7*). In the optimal schedule 11(5'), these remaining jobs must 
be scheduled optimally in positions (|A 7 *( 5 )| 2) to 15|. In effect, we can consider a 

new scheduling problem for the subset of jobs S', and apply proposition 1 to this new 
subset, and so on. Our method implements this iterative procedure. We formally describe 
the algorithm next. In this description, r is the iteration counter, Ir is the pointer to the 
last position in the schedule that is filled in the rth iteration, and Sr is the set of remaining 
unscheduled jobs at the beginning of iteration r. 


The Forward zero-base algorithm 

Step 0: Initializcition 
Set r <— 1; 

Sr -^{ 1 , 2 , 

Ir-l = 0 . 

Step 1: Iterative step 

(a) Find the job 7 *(r) with maximum penalty in set Sr', 

Aj*(r)(Sr) := Set of all ancestors of J*(r) in Sr. 

(b) Set Ir <— Ir-l + |Aj*(r)(5,-)l + L 
Assign 7 *(r) to position Ir 

(c) Assign jobs of the set Aj*(r)(Sr) to positions (Z^-i + 1) through 
/r — 1 in increasing order of job indices. 

(d) Set 5,+i 4 - 5, - Aj^r)(Sr) - {J*(r)} 

(e) If 5^+1 is empty, Stop. The current schedule is optimal; 

Else, set r ^ r -f-1, and return to step 1(a). 

Observe that the iterative step is performed at the most n times. We refer to the job 7 * (r) 
with the maximum penalty at the rth step as the rth bottleneck job. As r increases, the 
corresponding bottleneck jobs have successively lower penalties. 


iteration counter 

..., n}; set of unscheduled jobs at iteration r 

last position scheduled in previous iteration 
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Ranking of jobs in order <; i a 7 4 

of decreasing penalties : > » ’ » ^ 

n = number of jobs = 6 
m = number of arcs in precedence graph = 7 

Figure 3. Example for Forward zero-base algorithm. 


3.1a Example: To illustrate the Forward algorithm, consider the precedence graph and 
the relative ordering of 6 jobs (in decreasing order of penalties) shown in figure 3. 

Initially, all jobs are unscheduled, and the job with the largest penalty is job 5 (i.e., 
7*(1) = 5). This job has two unscheduled ancestors, jobs 1 and 2, i.e., A5(5i) = {1, 2). 
The first iteration schedules the ancestors in positions 1 and 2, and schedules job 5 in 
position l\ = 3. At the second iteration, job 6 has the largest penalty among all remaining 
jobs. Its unscheduled ancestor, job 3, is assigned to position 4, while job 6 is scheduled 
in position 5. In the final iteration, the only remaining job (job 4) is scheduled in the last 
position. Table 1 summarizes these computations. 


3.1b Data structures and computational complexity: To perform the Forward algo¬ 
rithm’s computations efficiently, we maintain a special data structure, and make some 
minor algorithmic changes. Our implementation first sorts the jobs in decreasing order of 
penalties prior to initiating the main algorithm. This sorting operation requires O (n log n) 
effort, and will facilitate the process of identifying the bottleneck job at each iteration of the 
main procedure. Also, our implementation does not determine the exact positions for the 
unscheduled ancestors in the set Aj* (r) (Sr) (i.e., it does not perform step 1 (c)) immediately 
after each iteration. Instead, we reindex these jobs temporarily, and determine the actual 


Table 1. Forward zero-base algorithm iterations for example. 


Iteration 

k 

Unscheduled jobs 

Sk 

Bottleneck job 

r(k) 

Unscheduled ancestors 
Aj*{k) (Sk) 

1 

1,2,3,4, 5,6 

5 

1,2 

2 

3, 4, 6 

6 

3 

3 

4 

4 

— 


Optimal schedule: 1-2-5-3-6-4 
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final schedule at the end of the main algorithm by performing an overall sorting operation. 
Initially, all jobs have a temporary index of 0. At iteration r, we assign the temporary index 
(nr + J) to each job J € Aj*(,-)(Sr) that is scheduled during that iteration, and job j*(r) is 
assigned the index (nr + j*(r)). Thus, all jobs that must be scheduled in the rth iteration 
(between positions (/,-_] + 1) and (//• — 1)) have temporary indices in the range («r + 1) to 
n(r After the main algorithm terminates, we sort the jobs in increasing order of their 
temporary indices (0(n log n) effort) to obtain the final optimal schedule. Observe that the 
largest possible value of a temporary index is n(n + 1). Also, at intermediate iterations, all 
the jobs that have not yet been scheduled are easy to identify since they have temporary 
indices of 0. 

Let us now analyse the computational complexity of the Forward algorithm. First, iden¬ 
tifying the successive bottleneck jobs involves sequentially scanning the sorted list of jobs, 
which requires 0(n) total effort. (At step r, the rth bottleneck job j*(r) is the first job 
following 7 * (r — 1) in the sorted list with a temporary index of 0.) Now, consider the effort 
required to identify the unscheduled ancestors of job j*(r) (in step 1(a)). Starting with 
job j*(r), we trace back all unscheduled ancestors using the pointers to the immediate 
predecessors in the linked list representation of the precedence graph. If we encounter a 
previously scheduled job, we need not explore its ancestors since these ancestors must ail 
be scheduled previously. Thus, the total effort required to identify the members of the set 
Aj*(r) (Sr) is O (m) over all iterations (since we examine each edge in the precedence graph 
exactly once). Combined with the initial and final sorting operations, we get an overall 
complexity of O (m -(- /i log n). In general, the number of arcs m in the precedence graph is 
0(n^y, hence, the Forward algorithm is no better than Lawler’s original algorithm in the 
worst-case. However, for problems with sparse precedence graphs, we expect the Forward 
algorithm to perform better. 

The Forward algorithm also extends to the more general 1 /prec, rj, pmtn//max problem 
where jobs have different release times rj that are known in advance, and preemption 
is permitted. Appendix 1 describes this extension. Later (in § 5), we use the schedule 
generated by this enhanced method as the benchmark to evaluate the effectiveness of 
on-line reoptimization when job release times are not known in advance. 


3.1c Adapting Lawler’s algorithm for consistent penalty functions: Note that when the 
penalty functions are consistent, we can also adapt Lawler’s original 0(n^) algorithm 
for l/prec//max to run in 0(m n log/i) time. Recall that, at each stage k, for k = n, 
n — 1,..., 1, Lawler’s algorithm selects the eligible job j e Rk with the smallest penalty 
at the current completion time 7^. For general penalty functions, the order of eligible jobs 
(arranged in increasing order of penalty values at Tk) might change from stage to stage. 
However, with consistent penalty functions, the order is invariant. To exploit this property 
we use a heap structure to store the currently eligible jobs in sorted order (increasing 
penalties) at each stage. At stage k, we - (i) schedule the job that is currently at the 
root of the heap, (ii) delete this job from the heap, and (iii) insert in the heap all its 
immediate predecessors that just became eligible. Inserting and removing each job from 
the heap entails 0(n logn) total effort; and, checking the eligibility of jobs at each stage 
requires 0(m) effort (since each arc of the precedence graph must be examined once). 



Hence, the overall complexity of the heap implementation of Lawler’s static algorithm 
is 0(m + n log n), which is the same as the computational complexity of our Forward 
algorithm. However, as we show next, the Forward algorithm is more amenable to updating 
existing schedules. 

3.2 The Forward updating algorithm 

For the updating problem, we are given an optimal n-job schedule, and a new job, indexed 
as (n + 1), with prespecified immediate predecessors 5„+i, arrives. We initially assume 
that job {n + 1) does not have any successors in the current set of n jobs; later, we indicate 
how to apply the updating method when the new job also has successors among currently 
scheduled jobs. Let FT = [ji, ji, h-> ■. jn\ denote the current optimal n-job schedule 
where jj^ denotes the index of the job that is scheduled in the k\h position. The updating 
problem consists of constructing a new optimal schedule H' = {j[, jn, jn+\} that 

includes the new job (« + 1). 

The updating procedure uses information on the bottleneck jobs corresponding to the 
current schedule. As we mentioned in §3.1, the successive bottleneck jobs must have 
successively lower penalties; hence, the current sequence FI already lists the bottleneck 
jobs in order of decreasing penalties. Consider now the position for job (n + 1) in the 
(n + l)-job optimal schedule H', assuming we applied the Forward zero-base algorithm. 
Since job (n-f 1) does not precede any current jobs, its position in the schedule is determined 
solely by its penalty function relative to the current bottleneck jobs. In particular, suppose 
fj*{r-i) < fn+i < t.Q., the new job’s penalty lies between the penalties for the 

(r — l)th and rth bottleneck jobs. Since the current sequence schedules existing bottleneck 
jobs in decreasing penalty order, we can identify the index r in linear time. And, the 
updating procedure must merely insert job (n -\- 1) and all its previously unscheduled 
ancestors (i.e., ancestors that are not scheduled in positions 1 to Ir-i) immediately after 
the (r — l)th bottleneck job j*{r — 1). 

Assuming that we are initially given only the immediate predecessors 1 of job (w -f 1), 

finding the members of A„+i (5r) (the set of unscheduled ancestors for job (n +1)) requires 
O (in') operations, where m' is the number of edges in the precedence subgraph induced 
by job (n -t- 1) and its ancestors. We must then assign these ancestors to consecutive 
positions, starting with position (/^-i -j- 1). Observe that the current schedule satisfies the 
precedence constraints among all ancestors of job (n -t-1). Hence, the jobs in A„+] (Sr) 
need not be re-sorted to satisfy precedence constraints; instead, we get a feasible schedule 
by merely scheduling these ancestors in the order in which they occur in the current 
schedule. Adjusting the current schedule in this manner requires at most 0(n') effort, 
where n' is the number of ancestors of job (n -F 1). Thus, the overall complexity of the 
Forward updating procedure is O (m' + n') compared with 0(m-\-n log n) for the Forward 
zero-base algorithm. Finally, note that the updating procedure can easily accommodate 
new jobs that must precede existing jobs. Let js be the immediate successor of job (n-\-l) 
that is scheduled earliest in the current schedule, and let Is denote its position in the current 
schedule. In the new schedule, the jobs that are currently scheduled in positions Is and 
beyond will retain their relative order. Therefore, we need to apply the updating procedure 
only to jobs that are currently scheduled in positions 1 to - 1. 



Ranking of jobs in order < i 7 7 a 
of decreasing penalties ; - ^ * 3 '3 b, A, 2, 4 

Updated optimal schedule: 1 - 2- 5- 3- 4>7-6 

Figure 4. Example for Forward updating algorithm. 

3.2a Example: For the example shown in figure 3, figure 4 illustrates the updating 
calculations when a new job (job 7) enters the system. This job has two immediate prede¬ 
cessors, jobs 4 and 5, and its penalty value lies between those of jobs 1 and 6 . Job 7 must, 
therefore, be scheduled between the two consecutive bottleneck jobs 5 and 6 . Job 7 has two 
ancestors, jobs 3 and 4, that are not scheduled prior to job 5. Hence, we assign these two jobs 
to positions 4 and 5, respectively (preserving their relative order in the current schedule); 
job 7 occupies position 6 , followed by job 6 . Figure 4 shows the updated schedule. 

4. Computational results 

Section 3 showed that the updating algorithm has better worst-case complexity than the 
zero-base algorithm. To verify this computational superiority in practice, we compared the 
computation times using the Forward zero-base algorithm and the updating procedure for 
an extensive set of random test problems ranging in size from 100 jobs to 400jobs. We first 
describe the random problem generating procedure before presenting the computational 
results. 

4.1 Random problem generation 

Our random problem generator requires two user-specified parameters: the number of 
jobs («), and the density <5 of the precedence graph (0 < 5 < 1). Initially, we attempted 
4 to generate precedence graphs with random topologies by independently selecting arcs 
(/, j), for any pair of nodes i and j, with probability 5. We discovered, however, that the 
resulting precedence graphs contained many redundant arcs. For example, if the graph 
contains arcs (i, j), ( 7 , k), and (/, k), then arc (i, k) can be deleted since this precedence 
order (i.e., i preceding k) is implied by the other two arcs. Consequently, the reduced 
graph (with redundant arcs eliminated) was often much sparser than the desired density 
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values. To overcome this problem and to avoid checking for redundancies, we decided to 
use layered precedence graphs that contain only arcs between successive layers; hence, 
none of the arcs are redundant. 

To generate a random layered graph containing n nodes and with density parameter 8, 
the problem generator 

• randomly selects the number of layers (L) in the graph (0 < L < n); 

• equally divides the number of nodes among the layers; and, 

• for each pair of nodes i and j in successive layers, selects arc (i, j) with probability 8. 

For the updating problem, the random generator assigns the new job (n +1) to a new layer, 
and connects node (n + 1) to nodes in the previous layer with probability 8. 

We implemented the zero-base and updating versions of the Forward algorithm in 
PASCAL on a Sun 4/390 workstation. For our computational tests, we considered four 
network sizes, with number of nodes n = 100, 200, 300, and 400, and five values of the 
density parameter 5 = 0.10, 0.25, 0.50, 0.75, and 0.90. For each combination (n, 8), we 
generated 100 random problem instances. Table 2 summarizes the mean (over 100 random 
instances) and standard deviation of CPU times (to add the (n + l)th job to the current 
schedule) for the zero-base and updating versions of the Forward algorithm for all the 
(n, 5) combinations. 

As table 2 shows, the updating algorithm is faster than the zero-base version by a factor 
ranging from 2 to 5. Thus, for scheduhng contexts that require numerous, frequent updates, 
the magnitude of computational savings using the updating method can be substantial. 
For the 100-job problems, the CPU time for individual problem instances varies widely 
as indicated by the large standard deviation (relative to the mean). As the problem size 
increases, the CPU time for updating relative to zero-base scheduling appears to increase. 
The density parameter does not seem to have a significant or consistent effect on this ratio. 


Table 2. Computation times for updating versus zero-base algorithm. 
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0.90 

24.9^ 

5.0 

76.7 

25.3 

141.6 

58.6 

222.0 

102.4 


(10.7)§ 

(7.7) 

(13.0) 

(8.9) 

(19.6) 

(9.0) 

(27.8) 

(12.6) 

0.75 

27.6 

(7.8) 

6.7 

(8.3) 

76.3 

(12.1) 

23.4 

(9.5) 

138.7 

(19.6) 

58.3 

(9.7) 

219.0 

(24.3) 

103.4 

(8.9) 

0.50 

24.6 

(8.3) 

5.7 

(8.0) 

73.9 

(14.3) 

29.9 

(11.2) 

132.7 

(17.5) 

56.0 

(8.7) 

212.1 

(23.8) 

100.0 

(7.5) 

0.25 

23.7 

(9.5) 

7.7 

(8.4) 

66.3 

(10.4) 

26.4 

(8.9) 

124.7 

(11.7) 

55.7 

(9.4) 

200.0 

(22.6) 

102.3 

(10.1) 

0.10 

24.4 

(9.0) 

5.7 

(8.1) 

66.3 

(11.8) 

25.3 

(8.2) 

123.9 

(12.1) 

58.1 

(10.2) 

186.0 

(15.5) 

104.0 

(13.8) 


^ Average CPU time (in milliseconds on a SUN 4/90) over 100 random problem instances; 
§ Standard deviation of CPU times 
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5. Competitiveness of on-line reoptimization 

Having demonstrated the relative efficiency of using tailored updating methods instead 
of zero-base algorithms to accomodate new jobs, we now examine the effectiveness of 
on-line reoptimization as a heuristic strategy to schedule dynamic systems. One approach 
to evaluate this effectiveness is to assume a tractable stochastic model for job arrivals, 
processing times, and precedence relationships, and analyse the expected performance of 
on-line reoptimization (or other dispatch rules) in this framework. However, this mode of 
analysis is often sensitive to the choice of the stochastic model governing the occurrence 
of random events. 

Recently several researchers in theoretical computer science (e.g., Borodin et al 1987, 
Manasse et al 1988, Chung et al 1989) have developed an alternate approach to study 
on-line effectiveness using the notion of competitiveness. The approach involves charac¬ 
terizing the worst-case performance of the on-line method compared to an optimal off-line 
procedure that has perfect information about the future. In particular, for problems with a 
minimization objective, an on-line algorithm A is said to be c-competitive if the inequality, 

Ca — ^ Co ■+■ Of, 

holds for any instance of the on-line problem. Here, Ca denotes the “cost” incurred by 
the on-line algorithm A, and Cq is the cost for the optimal off-line solution with clairvoy¬ 
ance. Thus, competitiveness is a useful measure for performance analysis of incremental 
algorithms. Researchers have started applying this measure to scheduling problems only 
recently; Shmoys etal (1991) address competitiveness issues related to on-line scheduling 
of parallel machines to minimize makespan. 

This section demonstrates the underlying principles and techniques of competitiveness 
analysis applied to our single machine scheduling problem. Developing bounds on the 
relative difference between the on-line and off-line optimal objective values for general 
penalty functions fj () is difficult since we cannot exploit any special properties of the 
solutions. We, therefore, need to separately study various specialisations of fjO- We con¬ 
sider two types of penalty functions - subadditive functions and lateness. For subadditive 
penalty functions, we show that on-line reoptimization is 2-competitive when we permit 
preemptions, and (p 4- 2)-competitive for non-preemptive scheduling, where p is the as¬ 
pect ratio (defined later). We then prove 2-competitiveness of on-line reoptimization for 
the lateness penalty case (delivery-time version). 

Before describing and proving the competitiveness results, let us clarify the context and 
the mechanics of on-line reoptimization. Jobs arrive randomly at various release times rj . 
Assume that jobs are indexed in the order in which they arrive. We study effectiveness for 
both preemptive and non-preemptive scheduling problems. First, consider the case when 
preemptions are permitted, i.e., at each arrival epoch rj, the job that is currently in process, 
say, job u can be interrupted and resumed later without any additional setup or reprocessing 
effort. In this case, applying the updating method involves: (i) determining the rank, say, 
r* (in decreasing penalty order) of the new job j relative to all the currently available 
jobs (including the current in-process job u), and (ii) inserting job j and its unscheduled 
ancestors immediately after the (r* — l)th bottleneck job in the current schedule. Notice 
that the current job u is preempted only if job j has a higher penalty than all other available 



jobs. In the non-preemptive case, job w must necessarily be completed first, and only the 
remaining jobs can be rescheduled. Hence, job j is ranked only relative to these remaining 
jobs. 

This on-line updating algorithm is a heuristic method that does not guarantee long-run 
optimality of the schedules. The benchmark for comparing the performance of the on-line 
method is an optimal off-line schedule that has prior knowledge (at time 0) about the exact 
arrival times rj for all jobs j. In Graham et aV^ (1979) nomenclature, the off-line schedule 
is the optimal solution to either the 1/prec, Vj, pmtn//max or 1/prec, rj/fmax problem 
depending on whether or not preemption is permitted. Note that the 1 /prec//max problem 
(with preemption) is polynomially solvable using, say, the enhanced Forward algorithm 
described in appendix A, while the 1 /prec, rj //max problem (without preemption) is known 
to be NP-hard. Indeed, the non-preemptive problem remains NP-hard even if we restrict 
the penalty /max to maximum lateness Lmax, and relax the precedence constraints (i.e., for 
the 1 /rjjLmax problem). 

5.1 Subadditive penalty functions 

A penalty function /• () is said to be subadditive if it satisfies the condition 
fj < fj (ti ) + /} (f 2 ), for all r 1 , ^2 ^ 0. 

Interesting special cases of subadditive functions include concave penalty functions, and 
linear completion time penalties (i.e., fj(t) = Bit). We denote the maximum subadditive 
penalty as /max- This section studies the competitiveness of on-line reoptimization, with 
and without preemption, when all jobs have consistent, subadditive penalty functions. 
When preemptions are permitted, we show that the on-line reoptimization strategy is 2- 
competitive, i.e., the objective value of the on-line schedule is at most twice the optimal 
off-line value. When preemption is prohibited, the worst-case ratio increases to (p + 2), 
where p is a specified ratio of job processing times. 

5.1a Scheduling with preemptions: We now show that, for any ^-job problem, the pre¬ 
emptive schedule obtained using on-line reoptimization (without anticipating future job 
arrivals) has a worst-case performance ratio of 2 relative to the optimal off-line schedule 
for the l/rj, prec, pmtn//^^x problem constructed by the enhanced Forward algorithm 
(appendix A). 

Theorem 1. For the 1 / prec, rj, pmtn/ problem, on-line reoptimization is 2-competitive. 

Proof Let H be the preemptive schedule obtained using on-line reoptimization for a 1^-job 
problem. Let fk be the (maximum) completion time penalty for this schedule, and let / 
denote the critical job, i.e., = fyity) = max{/,(ty): 1 < y < k], where tj is the 

completion time for job j in the schedule FI. First, we note several characteristics of the 
schedule FI. Since j' is the critical job, all jobs following job / in schedule FT must have 
lower penalty functions (otherwise, a job with higher penalty function that is scheduled 
later than j' would be the critical job). Consider now the interval of time [rj/, tj>] between 
the arrival of job j' and its completion. Let J' denote the set of all jobs j that are completed 
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in this interval and having equal or higher penalty functions {J' also includes job j). Let 
A{J') denote the set of ancestors j e J' of all jobs in the set J'. Clearly, every job that 
the on-line schedule completes in the interval [rjf, tj/] must either be a member of the set 
J' or an ancestor of some job j e J'. Also, the completion time tj> for job / in schedule 
n has the following upper bound: 

+ Pj‘ 

jeJ' j€A(J') 

Now consider the optimal off-line schedule n* which uses prior information on job release 
times. Let 0^ be the (maximum) completion time penalty for this schedule, and let 7) be 
the completion time for job j in O*. Among all the jobs belonging to the set J', let j" be 
the job that is scheduled last in FI*. Observe that 

(l>t>fj"{Tjn)>fy{Ty>), (2) 

Tj">y2pj+ L Pj, and (3) 

j&J' j&AW) 

Ty>ry. (4) 

Inequality (2) follows from the definition of <p^, and because job j" e J' has an equal or 
higher penalty function than job /. Inequalities (3) and (4) hold because both / and j" 
belong to the set J', and job j" is completed last in FI*. From (1), (3), and (4), we have 

tj/ Tj// -f- Tj// — 2Tj//. (5) 

Inequalities (2) and (5) imply that 


< fj/{2Tj//), from (5), 

< 2fj/{Tj/f), from subadditivity, and 

< 20^, from (2). 

Hence, the on-line reoptimization strategy is 2-competitive. 


Claim. The worst-case bound of 2 (proved in theorem 1) is tight. 

The following example justifies this claim. Consider a 3-job problem instance with linear 
completion time penalties fj (t) = fjt. Job 1 is unrelated, while job 2 precedes job 3. Jobs 
1 and 2 arrive at time 0, each requiring 10 units of processing time, and job 3 arrives at 
time 10 with a very small processing time. The jobs have the following ordering in terms 
of penalty functions: f3 > f\ > fi- In our notation, the following parameters describe 
this problem instance: ri = 0, r 2 = 0, and r^, — 10; pi = 10, p 2 = 10, and ps = £\ 
53 = {2}; and = 1, ^62 = 0, and = 100. 

At time 0, both jobs 1 and 2 are available, but job 1 has higher penalty. Hence, the 
on-line method processes it first. The next epoch is at time 10, when job 3 arrives (and job 
1 completes). At this time, the on-line method starts job 2 (to satisfy job 3’s precedence 

^ at timo (ICi _L c•^ TrvK ^ ic tVip pritipjil with a 
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method anticipates job 3’s higher penalty and delayed arrival at r = 10. Hence, the optimal 
off-line sequence is 2-3-1. Job 3, completed at (10 -f- e), is again the critical job, with 
a penalty of 100*(10 £)■ Thus, the ratio of on-line to off-line penalty values is (20 + 

£)/(10 s) which approaches 2 as £ approaches 0. Hence, the worst-case ratio of 2 is 
tight. Next, we show that, for the non-preemptive case, the worst-case ratio is higher. 


5.1b Scheduling without preemptions: As we noted earlier, finding the optimal off-line 
schedule with no preemption is computationally intractable. Hence, unlike the preemptive 
case, we do not have a convenient characterization of the optimal off-line schedule. To 
evaluate the competitiveness of on-line reoptimization for the non-preemptive case, we 
use the following strategy. We know that the optimal off-line preemptive schedule has a 
lower penalty than the optimal off-line non-preemptive schedule. Hence, if we can derive 
a worst-case ratio for the on-line, non-preemptive schedule with respect to the optimal, 
off-line preemptive schedule, this ratio should also hold for the off-line non-preemptive 
solution. Let p denote the maximum, over all job pairs i and j, of the ratio of processing 
time for job j to the sum of processing times for job i and all its ancestors, i.e.. 


p = max 


Pj 


pt+ L ^4 ■ 

ieA(i) I 


1 < i. j <k,i ^ j 


We refer to p as the Aspect ratio. Note that the denominator in the above expression is a 
lower bound on the earliest possible completion time of job i. Indeed, our competitiveness 
result applies even if we replace the denominator of p with a tighter lower bound (on job 
z’s completion time) involving, say, the release times of job i and its ancestors. 


Theorem!. For the l/prec, rj/f^^ problem, the on-line reoptimization method is 
max{3, (p 2)}-competitive. 


Proof. This proof is very similar to the proof for theorem 1. Let Unp be the on-line, 
non-preemptive schedule for a k-]ob problem. Let be the (maximum) penalty of this 
schedule, defined by the critical job /. Consider the interval of time [rjf, tj/] between the 
arrival of job j' and its completion in schedule Tlnp. Let u be the job that is currently in 
process in Unp when job / arrives, and let J' be the set of all jobs with equal or higher 
penalties than j' that are completed in the time interval [rp, tji]. Except for the in-process 
job u, all other jobs that are completed in this interval either belong to J’ or are ancestors 
of one or more jobs in 74 Let A(J') be the set of all ancestors j ^ J' for jobs in 74 As 
before, let H* be the optimal, off-line preemptive schedule; job y" 6 J' is scheduled last 
in n* among all jobs of 74 Tj denotes the completion time for job j in H*, and is the 
schedule’s penalty value. The job completion times in the on-line and off-line schedules 
must satisfy the following inequalities; 


tj' < rj! A- Pu-\- Yl> PF 

jeA{J') j€J' 


Tjff > max 


0' E Pj + JlPjh and 
;gA(J0 jeJ' J 


(7) 

( 8 ) 



( 9 ) 


>// max ry/ PJ ^ H PJ 

\ [ jeAiJ') JeJ' 

These inequalities imply that: 

<P7 = fr(‘r^ 

< fyiry) + fj'(ppj') + fr ( X! PJ + 12 ) 

\jeA(J') JsJ' / 
using subadditivity and (7) 

< 0^ + max{0j, yO0^} + 0^ using subadditivity and (9) 

= max{3, (p + 2)}0^. 

When p < 1, the competitive ratio is 3 and is otherwise no larger than (p + 2). Thus, the 
worst-case ratio for the on-line, non-preemptive schedule is at most max (3, (p - 1 - 2 )} relative 
to the optimal off-line, preemptive schedule. Hence, the on-line reoptimization strategy is 
at least max{3, (p -f- 2)}-competitive for the 1 /ry, prec, pmtn//fj^x problem. □ 

Claim. Assuming p > 1, the worst-case bound of max {3, (p -f 2)} for non-preemptive 
schedules is tight. 

To prove this claim, consider the following augmented version of the previous worst-case 
problem instance (described after theorem 1). In addition to the 3 jobs in that example, 
we have a fourth job that arrives at time r 4 = 0 , with penalty coefficient ^4 = 1 , and a 
processing time of p 4 = 20. Also, job 3 arrives at rs = 10 -f 5, for some small 5 > 0. Note 
that the aspect ratio p for this problem instance is p 4 /pi = 2. Consider, first, the schedule 
obtained using on-line reoptimization. Job 1 (or job 4) is scheduled first and completes 
at time 10. Since job 3 is not yet available, the method schedules job 4, followed by 
job 2 and finally job 3. Job 3 completes at time (40 -f £); it is the critical job with a 
penalty of 100* (40 + s). Contrast this on-line schedule with the following optimal, non- 
preemptive schedule: Job 2 starts at time 0 and completes at time 10; the processor is 
idle from time 10 to time (10 4 - 5); Job 3 starts at time (10 - 1 - <5), and completes at time 
(10 - 1 - 5 -f e), followed by jobs 1 and 4. Again, job 3 is the critical job, with a completion 
time penalty value of 100* (10 4 - 5 -|- e). Thus, the ratio of the on-line penalty and the 
optimal, off-line (non-preemptive) penalty approaches (p 4 - 2) = 4 as 5 and s tend to 
zero. 

5.2 The Lateness penalty function 

We now consider the Lateness objective Lmax* the penalty for job j is /y(ty) = 
tj — dj, where tj and dj are, respectively, the completion time and due date for job j, 
and Lmax = Max{/y(ty) \ j = 1, 2,..., n}. Our discussions focus on non-preemptive, 
precedence-constrained scheduling for this problem. The best off-line approximation al¬ 
gorithm for the 1 /ry/L^ax problem was developed by Hall & Shmoys (1992). They show 
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that, for the delivery-time formulation of this model (described later), a 4/3 approximation 
algorithm is possible using an enhanced version of Jackson’s earliest due date rule. Al¬ 
though Jackson’s mle can be applied on-line, the approximation techniques used by Hall 
and Shmoys sacrifice the on-line characteristic to achieve the tighter bounds. 

To study on-line competitiveness, we cannot work directly with the maximum lateness 
objective Lmax since some problem instances may have zero or negative optimal off¬ 
line Lmax values. (If the off-line Lmax is zero, the worst-case competitiveness becomes 
unbounded for any on-line algorithm that is even slightly suboptimal.) Instead of redefining 
competitiveness, we transform the Lmax problem to the following equivalent delivery time 
version (Potts 1980) which has a positive optimal value for all problem instances. 

5.2a The delivery time formulation: Let [dj] denote the job due dates, and let L” be a 
value greater than the largest due date. Now define the tail qj of job j as 

qj = K-dj. 

We interpret the tail qj as the time to deliver the job j after it is completed. Thus, the 
delivery time of job j is tj -1- qj , where tj is the completion time of job j . The objective 
of the scheduling problem now consists of minimizing the maximum delivery time over 
all jobs. The optimal schedule for this delivery time version also minimizes maximum. 
lateness. Observe that the penalty function /((/) = /)• -t- L — dj implied by the delivery 
time objective function is consistent (according to our definition in § 2), with jobs that 
are due earlier having higher penalty functions. Thus, at each job arrival epoch, on-line 
reoptimization for the delivery time problem (without preemptions) involves adjusting the 
current schedule to process the available job with the earliest due date as soon as possible 
(subject to completing the current in-process job and all unfinished ancestors of the EDD 
job). 

5.2b 2-Competitiveness of on-line reoptimization: For any given k-job instance of the 
non-preemptive delivery time problem, let n be the schedule obtained using on-line re¬ 
optimization. Let DTj be the delivery time of job j in this schedule; (pk is the maximum 
delivery time value over all jobs j. Denote the (maximum) delivery time of the optimal, 
off-line schedule FI* as 

Theorem 3. On-line reoptimization is 2-competitive for the non-preemptive delivery time 
problem, i.e., (pk < for all k. 

Proof We prove this result by induction. Assume that jobs are indexed in the order in 
which they arrive, and first consider k = 2. Since delivery time for a job equals the sum 
of its start time, processing time, and tail, the optimal off-line value 0^ must be greater 
than or equal to the maximum processing time over the jobs. In the two-job case, if both 
jobs arrive simultaneously, then the on-line updating method also constructs the optimal 
schedule. When the release times are different, say, job 1 arrives before job 2, the on¬ 
line method schedules job 1 before job 2. Suppose the optimal off-line solution consists 
of processing job 2 before job 1. Relative to this optimal schedule, the on-line schedule 



z s penalty in ir is a tower oouna on me optimal oir-iine oojective lunciion vame 
Hence, 


02 < 02 + /^1 ^ 02 maxfpi, P2} 

< 202 *. 

Thus, the on-line method is 2-competitive for 2 jobs. Now suppose the method is 2- 
competitive for all k'^{k — 1) jobs. We show that it must also be 2-competitive for k 
jobs. 

Let job / < /: be the critical job in the on-line schedule, i.e., <pk = DTy = sy -f- pjf-\-qjf, 
where sj is the start time for job j in the on-line schedule. 

Case 1. Sji < Sk-In this case, job j' starts earlier than the new job k but also has the 
highest delivery time. Hence, job / must have a higher penalty function (i.e., earlier due 
date) than job k. Since the Forward algorithm leaves all higher penalty jobs unaffected 
when job k arrives, job y'' must start at sy even in the (^ — 1) job on-line schedule. Hence, 
0/t-i > Sj> -f- pjf -f- qy = (pk. But, 0^_j < 0^, and 0^_i < 20J_^ by the induction 
hypothesis. Hence, (pk < 201". 

Case 2. sy > Sk -In this case, 

0j - O’' + Pf + 

= (sy -h py -f qy) - (sy - ry) 

= 0^ - (sy -ry). 

Note however that sj — rj < Ylj-i Pi all jobs j. 

Otherwise, the interval of time between the arrival and start of job j contains some idle 
time which is impossible using the on-line method (since job j is waiting in the queue). 
Therefore, 0^ > 0A: — Pi, which implies that pk < 20^. 

Thus, on-line reoptimization is 2-competitive. □ 

As before, we can show that the worst-case ratio of 2 (proved in theorem 3) for on-line 
reoptimization is tight. Indeed, for the non-preemptive delivery time problem, the following 
example (Kise & Uno 1978; Potts 1980) shows that any on-line scheduling algorithm that 
does not introduce forced idleness (i.e., does not keep the machine idle when the job queue 
is not empty) must have a worst-case ratio of at least 2. Consider a problem instance with 
two jobs that are released respectively at ri =0 and r 2 = 1, having processing times 
Pi = (P — 1) and p 2 = I, and due dates d\ = P and <^2 = 1 (hence, the tails are = 0 
and ^2 = (P — !))• With no precedence constraints, the best strategy consists of keeping 
the machine idle for the first time period, and scheduling job 2 before job 1, The maximum 
delivery time for this solution is (P -f 1). On the other hand, any on-line method that does 
not anticipate future job arrivals or introduce forced idleness will schedule job 1 at time 0 
(since it is the only available job). Since jobs cannot be preempted, job 2 begins processing 
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only at time (P — 1), and its delivery time is (2P - 1). As P becomes arbitrarily large, the 
ratio of on-line to optimal off-line delivery time approaches 2. And, on-line reoptimization 
achieves this lowest possible worst-case ratio. 

In retrospect, the 2-competitiveness of on-line reoptimization is not surprising in view 
of the worst-case bound of 2 for Schrage’s heuristic (Potts 1980). For the l/rj/DT^ax 
delivery time problem (without preemption or precedence constraints), Schrage’s heuris¬ 
tic consists of applying Jackson’s earliest due date rule on-line (with a longest processing 
time tie-breaking mle), i.e., whenever a job completes, the method dispatches the cur¬ 
rently available job with the earliest due date. Note that, without precedence constraints, 
on-line reoptimization also chooses this “current” FDD job sequence. Potts (1980) used 
a characterization of Schrage’s heuristic schedule (in terms of an interference job) to 
prove that the method has a worst-case bound of 2. When jobs have precedence con¬ 
straints, we can transform the delivery time problem 1/ry, prec/DPmax to an equivalent 
unconstrained version by revising the job release times and tails as follows: If job i must 
precede job set rj ^ max{rj, rj} and qi <- max{^i, + pj). Potts’ result implies 
that Schrage’s heuristic applied to this transformed problem is 2-competitive. Note that, 
when a new job arrives, updating the tails (to account for its precedence constraints) 
involves examining every currently available ancestor of the new job. In contrast, our on¬ 
line updating algorithm (appendix A) is more efficient since it first locates the new job 
relative to the current bottleneck jobs, and only examines ancestors that are scheduled 
later. 


6. Conclusion 

In this paper we have developed a new Forward algorithm and an updating version for 
one class of scheduling problems. The updating procedure reduces the computational 
effort to accomodate a new job into an existing schedule by using information from the 
current schedule. In contrast, applying a zero-base algorithm to reschedule all the jobs 
from scratch would entail significantly higher computational effort, as illustrated by our 
computational results of § 4. Section 5 gives partial results characterizing the effectiveness 
of using deterministic reoptimization for on-line scheduling. 

Our broader purpose in this paper is to demonstrate the scope, and efficiency and 
effectiveness issues in developing on-line updating algorithms for dynamic scheduling 
problems. In spite of their practical importance in contexts such as real-time control 
of distributed processors, updating algorithms have not been adequately studied in the 
scheduling literature. For illustrative purposes, we studied a single-machine scheduling 
problem that can be solved efficiently. Exploring similar updating methods for other 
scheduling objectives and contexts is an important research direction that merits further 
investigation. 
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Appendix A. Forward algorithm for the i/prec, ry , pmtn/ /max problem with 
consistent penalties 

The Forward algorithm described in § 3 assumes that all jobs are simultaneously available 
at time 0. This appendix describes an extension to handle arbitrary, but known, job release 
times; job preemption is permitted. 

First, we review the notation. We are given «jobs, and a precedence graph G : {N, A) 
containing m arcs. Bj and Aj represent, respectively, the set of immediate predecessors 
and ancestors of job j . Each job has a “consistent” non-decreasing penalty function fj (r), 
i.e., either fj{t) < fi(t) or fj(t) > /(r) for all completion times t, and fj(t) > fj{t') if 
t > t'. Let pj and rj denote, respectively, the processing time and release time for job j. 

Without loss of generality, we assume that all ancestors of any job j arrive at or before 
job 7 ; otherwise, we can set rj = Max{r/ -b pi ■. i e Bj). Also, for convenience, assume 
that jobs are indexed in order of release dates; consequently, i < j if job i precedes job j. 
We require a preemptive schedule that minimizes /max = Max{/(r 7 ) : j = \,2, ... ,n] 
while satisfying the precedence constraints and release times, where tj is the completion 
time for job j in the chosen schedule. 


Scheduling principle As before, the Forward algorithm identifies successive bottleneck 
jobs, and schedules each bottleneck job as early as possible. As before, we first sort all 
jobs in decreasing order of penalties. The method starts with an empty schedule, and 
progressively assigns jobs to appropriate “free” time intervals in the current schedule. At 
iteration r, the method has scheduled the first (r - 1 ) bottleneck jobs and all their ancestors. 
Let F denote the set of available free time intervals in the current schedule at the start of 
iteration r. The following steps are performed at iteration r: 

Step 1. Identify the rth bottleneck job i.e, job j*(r) has the largest penalty among 
all the currently unscheduled jobs; 

Step 2. Consider each unscheduled ancestor j of job j*(r) in increasing index order: 
allocate to job j the first available pj time units after the release time rj. Update the 
available free intervals. 

Step 3. Allocate to job j*(r) the first available Pj*{r) time units after the release time 
rj*(r)- 

The algorithm iteratively repeats this process - finding the next bottleneck job, and 
scheduling this job and all its ancestors as early as possible in increasing index order. Note 
that as we identify and schedule more bottleneck jobs, the schedule becomes fragmented, 
i.e., it has “holes” due to delayed releases for certain jobs. These holes are filled whenever 
possible in subsequent steps; filling the holes might introduce preemptions, i.e., the total 
processing time of a job may be distributed over several intervals. 

Let n be the final schedule constructed by the Forward algorithm, and denote the set of 
bottleneck jobs as Jb- 


Lemma. In the final schedule constructed by the Forward algorithm, 
/max = Max{/;•(/•) ■■ j € Jb). 


Proof of correctness of the Forward algorithm. Suppose the schedule FI constructed by 
the Forward algorithm is not optimal. Let j* be the critical ioh that determines the penalty 
of this schedule, i.e., /max = /j*((/*)> suppose j* is the r*th bottleneck job, i.e., 
f Let n' be an optimal schedule; let denote, respectively, the 

maximum penalty and the completion time for each job j in this schedule. 

By the hypothesis,/j^ax < /max = /;*(?/*). Since penalty functions are non-decreasing, 
this inequality implies that job y * must be scheduled earlier in i.e., tp < tj* . Since the 
Forward algorithm schedules all bottleneck jobs as early as possible in order of decreasing 
penalties, tp can be less than tj* only if the schedule n' completes some previous bottleneck 
job (i.e., a bottleneck job with a higher penalty than job j* and scheduled before tj* in 
the Forward schedule n) on or after ry*. Let i*{r) for some r < r* denote this bottleneck 
job. Since tp^^^ > tj*, we must have f^^^ > fj*(r)(ij*)- However, since the job j*(r) has 
a higher penalty than j*, fj*(r)0j*) > //*(/*) == /max» contradicting the optimality of 
schedule H'. □ 


References 

Aho A V, Hopcroft J E, Ullman J D 1974 The design and analysis of computer algorithms 
(Reading, MA; Addison-Wesley) 

Baker K R, Lawler E L, Lenstra J K, Rinnooy Kan A H G 1983 Preemptive scheduling of a single 
machine to minimize maximum cost subject to release dates and precedence constraints. Open 
Res. 31:381-386 

Borodin A, Linial N, Saks M 1987 An optimal on-line algorithm for metrical task systems. Proc. 
of 19th ACM Symposium on Theory of Computing, pp 373-382 

Chin F, Houck D 1978 Algorithms for updating minimum spanning trees. J. Comput. Syst. ScL 
16;333-344 

Chung F R K, Graham R L, Saks M E1989 A dynamic location problem for graphs. Combinatorica 
9: 111-131 

Even S, Shiloach Y 1981 An on-line edge deletion problem. J. Assoc. Comput. Mach. 28: 1-4 

Graham R L, Lawler E L, Lenstra J K, Rinnooy Kan A H G 1979 Optimization and approximation 
in deterministic sequencing and scheduling: A survey. Ann. Discrete Math. 5: 287-326. 

Frederickson G N 1985 Data structures for on-line updating of minimum spanning trees with 
applications. SIAM J. Comput. 14: 781-798 

Frederickson G N, Srinivas M A 1984 On-line updating of degree-constrained minimum span¬ 
ning trees. Proceedings of the 22nd Allerton Conference on Communication, Control, and 
Computing, October (New York; IEEE Press) 

Hall L A, Shmoys D 1992 Jackson’s rule: Making a good heuristic better. Math. Open Res. 17: 
22-35 

Jackson J R 1955 Scheduling a production line to minimize maximum tardiness. Research Report 
43, Management Science Research Project, University of California, Los Angeles 

BCise H, Uno M 1978 One-machine scheduling problems with earliest start and due time con¬ 
straints. Mem. Kyoto Tech. Univ. Sci. Technol. 27: 25-34 



On-line maintenance of optimal machine schedules 


279 


Lawler E L 1973 Optimal sequencing of a single machine subject to precedence constraints. 
Manage. Sci. 19: 544-546 

Lawler E L, Lenstra J K, Rinnooy Kan A H G 1982 Recent developments in deterministic 
sequencing and scheduling; A survey. In Deterministic and stochastic scheduling (eds) 
MAH Dempster, J K Lenstra, A H G Rinnooy Kan (Dordrecht: Riedel) 

Malone T W, Pikes R E, Grant K R, Howard M T 1988 Enterprise: A market-like task scheduler 
for distributed computing environments. In The ecology of computation (ed) B A Huberman 
(Amsterdam: Elsevier Science) pp 177-205 

Manasse M S, McGeoch L A, Sleator D D 1988 Competitive algorithms for on-line problems. 

Proc. 20th ACM Symposium on Theory of Computing (New York: ACM Press) pp 322-333 
Overmars M H, van Leeuwen J 1981 Maintenance of configurations in the plane. J. Comput. Syst. 
Sci. 23; 166-204 

Potts C N 1980 Analysis of a heuristic for one machine sequencing with release dates and delivery 
times. Oper. Res. 28: 1436-1441 

Ramamritham K, Stankovic J A 1984 Dynamic task scheduling in distributed hai*d real-time 
systems. IEEE Software 1: 96-107 

Sahni S, Cho Y 1979 Nearly on line scheduling of a uniform processor system with release times. 
SIAM J. Comput. 8 : 275-285 

Shmoys D, Wein J, Williamson DP 1991 On-line scheduling of parallel machines, preprint 
Spira P M, Pan A 1975 On finding and updating spanning trees and shortest paths. SIAM J. 
Comput. 4; 215-225 

Tarjan R E1983 Data structures and network algorithms (Philadelphia, PA: Soc. Ind. Appl. Math.) 
Zhao W, Ramamritham K 1985 Distributed scheduling using bidding and focused addressing. 
Proceedings of the Symposium on Real-time Systems (New York: IEEE Press) pp 103-111 



Advances in discrete material handling system design 


SRINIVASAN RAJAGOPALAN and SUNDERESH S HERAGU 

Decision Sciences and Engineering Systems Department, Rensselaer Polytech¬ 
nic Institute, Troy, NY 12180, USA 
e-mail: [rajags,herags]@rpi.edu 

Abstract. The paper presents an outline of the research done in the area of 
facility layout and discrete material handling system design. The objective of 
this paper is to observe the trend in the literature in the area of manufacturing 
system design and predict the direction of future research in this area. The 
paper attempts to link the areas of facility layout design and material flow 
network design. A methodology for solving the integrated design problem is 
presented. An algorithm which aids in solving the combined pick-up/drop-off 
point location and material handling flowpath problem is outlined. 

Keywords. Facility layout; automated guided vehicles; material handling 
system; integrated system design. 

1. Introduction 

The key design aspects involved in a manufacturing company are product design and 
process design. Product and process design involve designing the product while taking 
into consideration the processes to be used to manufacture that product. Process design 
includes equipment selection, designing the machine layout and the material handling 
system. Two key aspects that influence the efficiency of a manufacturing setup are the 
facility layout and material handling system design and operation. This paper analyzes the 
trend in the literature pertaining to facility layout and discrete material handling systems 
and projects the direction of future research in this area. 

2. Facility layout design 

Depending on how the machines are laid out, a manufacturing system can be broadly 
classified as; 

(a) Traditional manufacturing system; 

(b) Cellular manufacturing system. 

In a traditional manufacturing system, the machines are arranged within a given space 
to minimize the total material travel. The cellular manufacturing system is based on the 
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concept that all machines that perform operations on parts with similar processing requii 
ments can be classified and grouped into cells (Heragu 1994). A cellular manufacturi 
layout tries to minimize material handling costs between cells and also between the im 
vidual machines in a cell. The different methods available for solving the layout problc 
are the quadratic assignment problem (QAP) and its variations, graph theoretic approach 
and meta-heuristics based on simulated annealing, genetic algorithms and tabu search. F 
a more detailed survey of the techniques available for modelling and solving the fac 
ity layout problem, the reader is referred to Kusiak & Heragu (1987), Heragu & Kusi 
(1991), Heragu (1992, 1997) & Hassan (1994). 

The earliest formulation for the facility layout problem involved setting it up as a Q4 
The objective function in this case was a quadratic function which calculates the total c( 
of transferring material between machines. The constraints for this model specify that ea 
site is assigned to only one machine and that each machine can occupy only one site. T1 
formulation includes binary variables which indicate whether or not a machine is assign 
to a given site. The implicit assumptions in this model are that all the cells are of eqi 
size and shape. The QAP is known to be NP-complete (Sahni and Gonzalez 1976). 
the past, attempts to solve the QAP optimally have focussed on transforming it into 
equivalent linear mixed integer model. The transformed model, which has a larger numt 
of variables and constraints than the QAP, was then solved using branch and bound a 
cutting plane algorithms. These algorithms were good for solving problems involving 
maximum of 10 to 15 facilities. 

Exchange algorithms based on the 2--opt and 3-opt methods have also been used to sol 
the layout problem. These algorithms are based on the principle of exchanging the positio 
of 2 or 3 facilities and checking the effect of this exchange on the objective function, 
an improvement in the objective function is found, then the new layout is selected. T 
drawback in these methods is that they do not consider all the possible layouts that ex 
in the solution space and therefore are sub-optimal algorithms. These algorithms need 
initial starting point which can be any feasible solution. The quality of the final soluti 
depends upon the initial solution. However, these methods allow the user to generate a fi 
good layouts from which the final layout can be selected. 

The shift in paradigm and focus on cellular manufacturing systems led to a focus 
an additional area of research, namely machine grouping. The design problem is divid 
into two parts - (a) Grouping of machines into cells, and (b) layout of the cells. In order 
generate an overall optimal layout, a solution approach which simultaneously determii 
the grouping of machines and the layout of the cells is desirable. However, due to I 
complexity in modelling and solving such a combined problem, most approaches atten 
to solve the two problems sequentially. For a more detailed explanation and a literati 
review of the cellular manufacturing system layout problem, the reader is referred to Cl; 
(1995). 


3. ‘ Material handling system design 



283 


Advances in discrete material handling system design 


material handling equipment to departmental moves and developing the flowpath for the 
ystem. The types of material handling devices to be used, are mainly dependent on the 
ize of the company, the types of operations taking place and amount of space available 
n the lactory floor. Smaller companies prefer to use more traditional material handling 
levices such as forklifts or trucks, sometimes in conjunction with pick and place robots. 
\ssembly operations and line-balancing operations tend to use conveyors because they 
illow the workers to perform the necessary operations on the part while it moves from one 
station to another. Automated storage/retrieval systems (AS/RS) are used in warehousing 
:)perations. Industries that are involved in sorting and packaging type of operations tend 
to use palletizers and sortation conveyors. The current trend seen in large manufacturing 
companies is the employment of automated guided vehicles (AGVs) as material handling 
devices. AGVs are preprogrammed trucks that are controlled through a central computer. 
The AGVs go through a predetermined route and make stops at designated machines, 
where the material is either picked up or dropped off by other automated material transfer 
devices. Because AGVs are expected to be the dominant material handling device in future 
manufacturing systems, this paper focusses on the use of AGVs in the material handling 
function. 


An AGV system comprises 3 parts, namely the vehicle, the guidance system and the 
control system. The vehicle in itself is a driverless unit with a built-in charging system, an 
on-board controller, a communication unit and a frame. The guidance system consists of a 
guidepath and a tracking system. The guidepath could be either a physical metal wire used 
for tracking (active tracking) or a wireless system which detects the vehicle based on optical 
or wireless metal detection techniques (passive tracking). The control system consists of 
the controller which brings together the vehicle and the tracking system. It acts as the 
brain for the entire system. The control system also takes upon the task of route selection, 
blocking and tracking loads. An AGV system has several advantages among which are 
reliability, automated operation, reduced material damage during transfer, ease of change 
in the layout and increased efficiency of the entire manufacturing system. It also offers 
the user real-time control of the material handling system and on-the-spot decisions such 
as change in routing or assignment of AGVs to specific moves. The main disadvantage of 
AGVs are their high cost. They also need polished floor surfaces and removal of obstacles 
in the flowpath before they can be put into operation. 

A control classification scheme for an AGVS is presented by Peters et al (1995). A cubic 
classification structure that indicates the complexity of the problem based on guidepath 
determination, vehicle capacity and vehicle addressing mechanism is presented. Details 
of the levels and sublevels that partition an AGV system are also presented. This paper 
concentrates more on the control aspects of an AGV system. Sinreich (1996) andHeragu 
& Rajagopalan (1996) discuss in detail the research done in the area of AGV flowpath 
design. Sinreich (1996) gives an overview of the design models that have been developed 
for discrete material flow systems. Heragu & Rajagopalan (1996) present a classification 
scheme which can be used as a tool in identifying the various aspects involved in developing 
a flowpath for an AGV based system and also discuss the merits and demerits of all the 
models developed so far. 

The classification scheme for the AGV flowpath design problem presented in Heragu 
& Rajagopalan (1996) is based on: 
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Type offlowpath - traditional, single loop, tandem layout, loop layout and terminal AGV 
flowpath; 

Travel direction - unidirectional, bi-directional; 

Solution techniques - mathematical programming models, rule based solutions, heuristic 
solution techniques, simulation methods, mixed approach (i.e. combined simulation and 
analytical approach); 

Assumptions made - layout given, pick-up/drop-off points given; 

Objective function - total loaded travel time, travel distance of loaded vehicles, total travel 
distance, maximum travel distance, total path distance, overall cost. 

The main categories for classification are the type of flowpath assumed in the problem, 
the type of AGV flowpath design required - i.e., the travel direction of the AGVs, the 
solution techniques used to solve the models developed, the assumptions made while 
developing these models and the objective function used in them. 


4. AGV flowpath design 

Maxwell & Muckstadt (1982) first identified the problem of AGV system design. They 
tackled the problem of identifying the number of AGVs required for a system assuming that 
the guidepath was already laid out. The first models developed to solve the layout problems 
were mathematical programming models which tried to minimize the total travel distance 
of loaded vehicles for a given layout. Branch and bound techniques were developed to 



Figure 1. A schematic showing a traditional AGV flowpath design. 
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blems. The traditional flowpath problem was also modelled as a graph 
ved. Figure 1 shows a schematic of a traditional flowpath. A graph network 
ck-up/drop-off points as nodes and the flowpaths as ai'cs. Pick-up/drop- 
;s are those points in a machine or a cell where the material handling 
unloads the material required for that machine or cell. Later formulations 
ze the total loaded and unloaded travel time for a given AGV system, 
approaches to solving the flowpath problem also include formulating it as 
iity flow problem. The solution methods to all these problems are mainly 
h are easy in solving small problems. Rule-based solutions are those which 
a problem and cannot be generalized. The rules generated simplify the 
consideration and reduce the size of the solution space. However most of 
become impractical when solving large, real life problems (i.e. problems 
20 facilities). Kouvelis et al (1992) developed a set of heuristics to solve 
ath problem. They presented 5 different heuristic approaches to solve the 
lowpath problem including a simulated annealing based approach. Usher 
esented a two-phase iterative heuristic to solve the unidirectional layout 
^termination of P/D points. The first phase develops a flowpath based on a 
nd in the second phase the P/D points are determined based on minimizing 
tween any two stations using m one-median problems. 

/elopment in AGV flowpath design came with the concept of an optimal 
jure 2). This concept developed by Tanchoco & Sinreich (1992) is an al- 
; traditional flowpath design. An optimal single loop involves connecting 
nts with the smallest possible single unidirectional loop. The simplicity of 
iminates problems such as collision, blocking, and other routing conflicts, 
ies to find the best single loop guidepath and locates the P/D points along 








decided based on a set of rules optimizing the desired performance criteria. Sinreich & 
Tanchoco (1991) also developed other heuristic-based and rule-based methods to solve 
the P/D point location and optimal single loop layout problem. They developed a centroid 
projection method which considered both inter-departmental and intra-departmental flows 
(Sinreich & Tanchoco 1991). A genetic algorithm based method was developed by Baner- 
jee & Zhou (1995) which designed the facility layout for a single loop-based material flow 
network. 

The loop layout design was formulated to ease the problem of designing the facility 
layout and the material-handling system. The wide application of such a layout also helped 
in increased research in this area. The loop layout consists of machines being arranged in a 
loop so as to facilitate material handling. In these problems, the material-handling system 
is assumed to be known and in most cases is either a unidirectional AGV or a conveyor. The 
problem then reduces to that of a single row layout problem with certain added constraints 
that machines may not be placed near the edges. Most solution techniques developed to 
solve these problems are either heuristic-based or rule-based methods. 

A concept which simplifies the material handling system design even further and makes 
the control aspects of an AGV somewhat easier is the tandem layout. Developing a tandem 
layout involves grouping machines into cells and having a single loop AGV for each cell. 
Material transfer stations aid in transferring material from one cell to another. Bozer & 
Srinivasan (1991) first discussed the concept of a tandem layout. They also developed a 
strategy for partitioning the shop floor into appropriate cells and developed a facility layout. 
The tandem layout assumes that each cell is serviced by a single unidirectional AGV. This 
design increases the chance of the entire production line being disrupted if any of the 
AGVs breaks down. Performance evaluation of a tandem layout versus the conventional 
AGV system has been done using petri-net models and simulation. The studies conclude 
that a tandem layout, under normal operating conditions, performs just as efficiently as 
a traditional layout and is also easier from a control perspective. Wang & Hafeez (1994) 
proposed a terminal-based AGV system, where the AGVs are located in a terminal and on 
receiving a signal, the first available AGV goes to the pickup station along a preassigned 
path and then proceeds to the dropoff point. On completion of the task, the AGV returns to 
the main terminal. Variations of the terminal based AGV system were developed by Johnson 
& Brandeau (1995). They used the principles of queuing network theory to evaluate their 
system. 

Apart from the different types of layouts and flowpaths discussed above, the area of 
AGV flowpath design also considered the travel direction as an important parameter. Most 
studies assumed that the travel direction of the AGV was unidirectional, i.e. it could travel 
in one direction only. Egbelu & Tanchoco (1986) conducted one of the earliest studies on 
the merits and demerits of using bi-directional travel as opposed to unidirectional travel. 
The problem of conflict routing and collision avoidance is more acute in the case of bi¬ 
directional AGVs. The solution methods used to solve this problem include techniques 
such as column generation (Krishnamurthy et al 1993), simulation (Faraji & Batta 1994), 
graph theoretic approaches (Dowsland & Greaves (1994)) and expert systems (Dhiouib & 
Kadi (1994). The scheduling of AGVs has also been studied widely by researchers (Ulsoy 
& Bilge 1993; Blazewicz et al 1994). The different scheduling rules, their relationship with 
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the routing problem and the effect of these on the overall operation of the manufacturing 
system has been researched using techniques such as simulation and queuing theory. 


5. Integrated design process 

In the current literature, the most common assumptions made when solving the flowpath 
problem are that the layout is already given and that the P/D points are already determined. 
The papers that deal with the loop layout problem assume that the material handling system 
has already been designed and then solve the layout problem. The tandem layout structure 
simplifies the flowpath problem, but relatively more AGVs are needed for a tandem layout 
than for a conventional layout and therefore it becomes a somewhat expensive proposition. 
Also, a breakdown of one of the vehicles in the system could bring the entire production 
to a halt. 

The current manufacturing design process involves a sequential process where the initial 
stage involves process selection and equipment selection. This is followed by the layout 
design phase. Once the layout has been decided, the P/D points are selected for this layout 
and then a material handling system is designed to minimize total travel time for the given 
layout and P/D point locations. In such a case, the system that is designed may not be 
optimal overall although each phase in itself might be optimal. In other words, the AGV 
flowpath designed is optimal for the given layout, but the combined layout and flowpath 
may not be globally optimal. In order to overcome this, a model which simultaneously 
solves the flowpath and the layout must be developed. However, this combined problem 
is known to be NP-complete (Banerjee & Zhou 1995) and solving such a model may not 
be possible. 

The area of integrated design has been tackled by a few researchers. Chajjed et al (1992) 
solved the problem of identifying the P/D points and developed a free-flow material¬ 
handling, network assuming the block layout was already known. Wu & Egbelu (1994) 
tackled the problem of concurrently determining the layout and material-handling design 
by using a multistage approach. They use a loop-layout model and solve the problem 
of locating the cells within the loop. In this case the material-handling flowpath design 
problem is a trivial problem which translates into determining the P/D points and layout 
along a loop. Palliyil & Goetschalckx (1996) used a dual simplex-based approach to 
solve the problem of determining the aisles and load stations for an aisle-based material¬ 
handling system. They assume that the layout is already known. Other works which tackle 
the problem of integrating two aspects of a manufacturing design problem include the 
combined layout and P/D point location problem (Heragu 1990), flowpath design and 
load size determination (Egbelu 1990) and equipment selection in conjunction with the 
load size determination (Noble et al 1994). 

Integrated design problems are part of the current area of research which is being widely 
pursued. Heragu and Rajagopalan (1996) present a framework for developing a near- 
optimal-solution to the manufacturing system design problem (figure 3). The framework 
involves a multi-stage solution procedure. The proposed strategy is a three-stage methodol- 
02 V that should result in an optimal or 20 od solution depending on the size of the problem 




Figure 3. A flowchart describing the methodology used for the integrated design process. 


the principles of cellular manufacturing system. The machines are grouped into cells and 
both the machine layout within a cell and the layout of the cells are decided. The material¬ 
handling system for intra-ceU material flow is decided based on the machines inside the 
cell and the operations being performed (figure 4). The P/D points for the cells are also 
determined. 

The layout and the P/D point location can be combined if necessary by using models 
similar to those developed by Chajjed et al (1992). The second stage involves solving the 
flowpath problem for the cells based on the layout and the P/D points developed. The next 
stage which is the most important stage in this process involves evaluating the performance 








Figure 4. Schematic to show inter cell and intra cell material handling movement. 


of the manufacturing system developed based on a predefined set of criteria. A feedback 
loop is required to ensure that the results obtained from the third stage are communicated 
to the first stage. The layout is then suitably altered and the process continues until all the 
criteria are met. The third stage which calculates the “goodness” of a solution consists of 
either a queuing-based model or a simulation model. 


6. Problem formulation 

In the case of bi-directional AGVs (where the AG Vs can travel in both directions), the 
flowpath design problem can be formulated as an undirected graph network problem. The 
P/D points are treated as nodes and the possible flowpaths connecting two P/D points are 
modelled as arcs. The problem of developing a flowpath is now equivalent to finding the 
shortest path between any two nodes. The assumptions made in this analysis are: 

(1) aisle spaces are known and AGV guides are laid along the aisles; 

(2) material transfer rate ( or the from - to matrix) is known; 

(3) number of AGVs required is already known. 

The objective function of the problem can be changed to minimizing the total guidewire 
length that is to be laid and the problem can be solved as a minimum spanning tree problem. 
However, this approach assumes that the P/D points are already known. In order to solve 
the P/D point location and the flowpath problem concurrently, one must assume that the 
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P/D points are not known. This results in a situation where travel distance between any 
two nodes is stochastic and the resulting problem is one of stochastic single period network 
flow. 

Every single point along the edge of a cell is a feasible P/D point and therefore in order 
to determine the optimal or the best P/D points all the points would have to be investigated. 
This results in a problem with infinite potential points mciking the problem intractable. For 
the purpose of this analysis, we will assume that both the pick-up and drop-off points for 
a given cell are at the same location. This assumption can be relaxed if necessary without 
increasing the complexity of the problem. 

The large continuous problem can be reduced by considering discrete points that repre¬ 
sent all the good solution points that exist. In selecting these scenarios, we must also ensure 
that we do not overlook a “good” or optimal P/D point. A few sample points are selected 
for each cell such that the maximum deviation from an optimal solution is reduced. The 
points selected for any cell are the extreme points, points of intersection with other cells, 
the centre of each side. Although these points may not be representative of the optimal 
solution to the problem, they will ensure that a good solution is obtained quickly. Hence, 
for any cell we only consider these 3 sets of points along each edge. In practice, a P/D 
point is never located at an edge and therefore the P/D points are assumed to be located a 
small distance ‘a’ away from the edge, where a = ((length of P/D station)/2) -f e where 
6 is a small value. This limits the number of scenarios we need to consider to a maximum 
of 17 for each cell. This can be further reduced by using dominance rules. 

The algorithm developed by Chajjed et al (1992) also helps in selecting P/D points and 
a flowpath for bi-directional AGVs. However, they assume a rectilinear distance matrix 
which is valid only in case all the P/D points are situated at extreme points and are of equal 
sizes. In case of a layout with unequal machines and where the P/D points are not situated 
at the extreme points of an edge, then their model can underestimate the actual travel 
distance by as much as the length of the largest side of a cell. For the case of large cells and 
a large number of trips, this underestimation adds up and results in a significantly lower 
travel distance than the actual distance. Kim & Klein (1996) also developed an optimal 
algorithm to develop the P/D points for an AGV system. They prove that the optimal P/D 
point will always lie at the extreme end of a cell. However, their optimal algorithm and 
proof is valid only for the case where all the cells or machines are of equal size. 

The problem of concurrently determining the P/D points and flowpath in case of bi¬ 
directional AGVs can be formulated as a shortest path network flow problem. In case 
of small or medium-sized problems, any existing network flow algorithm such as Floyd’s 
algorithm can be used for finding all shortest pairs (Ahuja et al 1993). In this case it is 
assumed that there exist no arcs between the nodes in the same cell to ensure that no more 
than one node is selected from each cell. 


7. Conclusions 

This paper presents an outline of the trends seen in layout and material-handling flowpath 
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towards more practical ways of tackling these problems and solving them quickly. The 
research focus is becoming more application based and several problems faced in the 
implementation stage are considered in the design stage itself. The manufacturing system 
design is now being considered as a whole instead of splitting it into various sub-problems. 
Although the entire problem being solved at once may not be feasible, a greater effort is 
being made to model the entire system or a set of subsystems. This paper also presents an 
idea for solving the P/D point location and material-handling flowpath problem efficiently 
and quickly. 
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Foreword 

This special issue of Sddhand is a collection of nine invited lectures presented at the 
International Conference on Advances in Mechanical Engineering (ICAME), held at the 
Indian Institute of Science, Bangalore in December 1995, to commemorate the Golden 
Jubilee of the setting up of the Department of Mechanical Engineering. In these fifty years 
the depaitment has played a significant role in the building up of a sound technological base 
in the country through the active interaction of its faculty and alumni with major industries, 
national laboratories and other research and development institutions in India and abroad, 
and has stimulated a continuous quest for challenging new directions in teaching, research 
and development activities. The department appropriately celebrated its golden jubilee by 
organising the ICAME to take stock of recent developments and envision future trends in 
mechanical engineering. When a proposal was made to disseminate the invited ICAME 
talks to a wider audience of scientists and engineers through Sddhand, it met with ready 
encouragement from the Indian Academy of Sciences, and in particular from Professors 
N Viswanadham and J Srinivasan, for which I thank them. 

The nine papers in the present issue represent a broad spectmm of current research 
in mechanical engineering. The first paper by Abhijit Guha considers two-phase flow of 
vapour-liquid mixtures with large numbers of minute droplets. He analyses the formation 
of and the impact of these droplets on the dynamics of this flow. Two-phase flows are present 
in a large number of industrial and natural situations, such as dusty gases, cavitation, steam 
turbines etc. Guha proposes a new theory to explain the nucleation process in a steam 
turbine. 

The second paper by Fazle Hussain and others describes vortex mechanics based on the 
unique characteristics of confined swirling incompressible flows. The development of the 
vortex engine is described in detail. Experiments and analyses performed on a laboratory 
“cold” model of the vortex engine are presented. 

Noncircular exits for gas-jets have been explored as a passive and inexpensive method to 
control the characteristics of fuel gas-jets, thereby controlling the combustion and pollutant 
characteristics. S R GoUahalli’s paper describes experimental studies on noncircular jet 
flames including flow visualisation of the stmcture of the jets. 

In the field of power transmission, compactness of the transmission element is becoming 
increasingly important. While stress and strength considerations have played a dominant 
role in design, controlling the load factor through load distribution/equalisation can be 
a powerful tool to realise compact transmissions. K Lakshminarayana’s paper presents 
the issues important in design for effective load distribution in transmission elements, in 
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particular, spur and helical gear systems. Application of parallel power paths to achieve 
better load distribution is explained. 

The fifth paper in the issue by K N Gupta discusses the analysis of vibration for condition 
monitoring and diagnosis, including basic principles, available instrumentation and various 
vibration monitoring techniques. The paper also presents interesting case studies that use 
vibration signatures for diagnosing failures. 

The paper by Subhash Sinha presents a general technique for the analysis of time- 
periodic nonlinear dynamical systems. In his approach, Sinha uses the Liapunov-Floquet 
transformation to convert quasi-linear periodic systems to linear systems that are time- 
invariant. The paper illustrates stability and bifurcation analyses of the transformed systems 
with two examples. 

The seventh paper by Amitabha Ghosh describes an experimental investigation on the 
non-traditional manufacturing process of electrochemical discharge machining. The author 
studies the process of spark generation and identifies inductance in the circuit as the sig¬ 
nificant process parameter. Two innovative applications of the process for micro-welding 
and rapid-prototyping are also proposed. 

The eighth paper by M Ramulu investigates machining mechanisms and machining- 
induced effects on the structural integrity of FRP materials. The paper presents results 
of experimental studies of orthogonal cutting of both unidirectional and multidirectional 
graphite/epoxy composites. 

The process of removal of material in the machining and surface finishing of brittle 
solids, such as ceramics, is studied by S Chandrasekar and T N Farris in the last paper of the 
issue. They present a model of the material removal process and identify the mechanisms 
responsible for material removal. The effect of these mechanisms on the machined surface 
is also described. 

Though the papers in the issue have not covered all areas of the vast field of mechanical 
engineering, they present a sample of the work going on around the world. I hope the issue 
will be of interest to researchers and practising engineers. 

I would like to end this foreword with an acknowledgement of the untiring efforts of 
Ms Shashikala and other editorial staff of the Academy in bringing out this special issue, an 
expression of sincere thanks to the contributors for their ready cooperation and a personal 
remembrance of one of the contributors. Prof K Lakshminarayana, who is sadly not in our 
midst today to see his lecture in print in Sadhand. 
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T S MRUTHYUNJAYA 
Guest Editor 
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Abstract. Non-equilibrium fluid mechanics and thermodynamics of two- 
phase vapour-droplet and gas-particle flow are considered. Formation of the 
droplets as well as their subsequent interaction with the vapour are discussed. 
A new theory of nucleation in steam turbines is developed that reproduces 
many aspects of measured droplet size spectra which cannot be explained by 
any available steady-flow theories. (Steam turbines are responsible for 80% of 
global electricity production and the presence of moisture significantly reduces 
turbine efficiency costing 50 million pounds per annum in UK alone.) Fluid 
dynamic interactions discussed include flow instabilities induced by conden¬ 
sation, condensation wave theory, relaxation gas dynamics for vapour-droplet 
flow, thermal choking due to non-equilibrium condensation, the structure of 
shock waves and their development through unsteady processes, and jump 
conditions and the interpretation of total pressure in two-phase flows. 

Keywords. Non-equilibrium fluid mechanics; thermodynamics of two-phase 
flows; nucleation in steam turbines. 


1. Introduction 



Two-phase flow of a vapour-liquid mixture consisting of a large number of minute liquid 
droplets uniformly dispersed throughout a background vapour phase continuum is both 
scientifically interesting and of considerable engineering importance (in a variety of ar¬ 
eas in mechanical, chemical and aerospace engineering, and meteorology). In the present 
paper, we discuss the formation of the droplets as well as their impact on the subsequent 
thermo-fluid dynamics of the flow. We restrict ourselves mainly to the description of pure 
substances, i.e., when both phases are of the same chemically pure species. Numerical illus¬ 
trations are given only for steam-water mixtures, but the general principles are applicable 
to other substances as well. 

What follows is a brief description, with no equations, of some of the work with which 
the present author has been involved over the past few years. The references cited (Guha 
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& Young 1989, 1991, 1994; Young & Guha 1991; Guha 1992, 1994, 1997) give a fuller 
treatment of these topics. The VKI lecture series (Guha 1995) treats most of these topics 
at much greater depth. It also contains a good repertoire of references and works of many 
researchers in this field which we do not reproduce here owing to space constraints. 


2. Thermo-fluid dynamics of condensation 

2.1 Physical description of homogeneous condensation 

All condensing (or evaporating) flows are non-equilibrium to a greater or less extent. 
Departures from equilibrium are measured by the subcooling AT which is the difference 
between the saturation temperature at local pressure and the actual vapour temperature 
(AT = Ts — Tg). AT governs the rate at which nuclei are formed as well as the rate at 
which established droplets grow (or evaporate). 

As pure, clean steam expands through a nozzle or a turbine blade passage, droplets 
do not appear as soon as the condition line crosses the saturation line. This is due to 
the existence of a free-energy barrier involved in creating new surface area. For some 
considerable time during expansion the steam remains dry in a metastable equilibrium until 
the subcooling becomes high enough to trigger an appreciable nucleation rate. Depending 
on the rate of expansion and the pressure, steam may become subcooled by 30^0°C while 
still remaining dry. The nucleation process leads to the formation of very large number 
(lO^'^-lO^^ nuclei per kg of steam) of tiny droplets (diameter < 1 nm), called the primary 
fog, more or less uniformly distributed in the continuous vapour phase. Nucleation is 
practically terminated at the point of maximum subcooling called the Wilson Point. For 
pure steam, if the Wilson points for tests with varying nozzle inlet conditions are plotted 
on the equilibrium Mollier diagram, they are contained within a narrow zone around a line 
called the Wilson line (which corresponds to approximately 3-A% equilibrium wetness 
line). 

The droplets thus formed then rapidly grow in size by exchanging heat and mass with 
the surrounding, subcooled vapour (the final droplet radii, r, in laboratory nozzles lie in 
the range 0.02-0.2 pm). The high rate of heat release as a result of rapid condensation, 
causes a sharp increase in vapour temperature and consequently an exponential decay of 
the subcooling. Depending on the values of the flow parameters, the initial growth phase 
of the droplets may give rise to a gradual increase in pressure known as “condensation 
shock”. The term “shock”, however, is a misnomer. Although pressure rises as a result of 
heat addition to supersonic flow, the Mach number downstream of the condensation zone 
usually remains above unity and, more importantly, the rise in pressure is gradual. 

In conventional laboratory nozzle experiments, where (dry saturated or superheated) 
steam is produced in a boiler, the flow must expand to supersonic velocities for significant 
subcooling to develop. However, if subcooled steam could be supplied at the nozzle inlet, 
homogeneous condensation could occur in the subsonic part of the flow. This is possible. 




densation zone (“supercritical condensation shock”). Under certain conditions this shock 
wave may become unstable and propagate towards the nozzle throat. The compressive 
wave ultimately interferes with the nucleation zone causing a reduction in nucleation rate 
and hence heat release rate. With the cause of its inception removed, the strength of the 
wave decreases and the flow again expands through the throat in a shock-free manner thus 
allowing the whole process to repeat itself. Such unsteady flow is normally observed in 
pure steam when the inlet stagnation temperature Tq is close to the saturation temperature 
at the inlet stagnation pressure pQ. Homogeneous condensation then occurs in the tran¬ 
sonic region close to the throat causing flow instability (the flow domain and boundary 
conditions remaining fixed). 

Keeping po fixed, if Tq is progressively reduced from superheated to subcooled levels, 
one encounters different regimes of homogeneous condensation in the order: Subcriti- 
cal condensation (the usual pressure humps characteristic of many condensation experi¬ 
ments), supercritical condensation (with inbuilt shock wave), oscillatory condensation and 
subsonic condensation (Guha 1994a). 

After the “condensation shock”, the steam generally reverts to near thermodynamic 
equilibrium at which the temperature of the vapour as well as of the droplets is close to 
the saturation level. Since the growth of liquid phase takes place by heat transfer through 
a finite temperature difference between the phases, the process is essentially irreversible 
and has associated with it a net rise in entropy. In turbines this appears as a reduction in the 
potential for performing work and is referred to as the thermodynamic wetness loss. This 
is a major component of the overall wetness loss. A simplistic version of an empirical rule, 
formulated by Baumann in 1921, states that the efficiency of a steam turbine decreases by 
1% for every 1% increase in mean wetness fraction. A typical value of the wetness fraction 
at the exit of a steam turbine in an electricity-generating power-plant is 10-12%. Thus, in the 
last stages, the wetness loss is comparable to the combined effects of the profile, secondary 
and tip leakage losses. A 1988 estimate by the then Central Electricity Generating Board 
of UK showed that the adverse wet steam effects cost them 50 million pounds per year. The 
global implication is thus quite serious, since steam turbines are responsible for about 80% 
of the world-wide generation of electricity and there is a considerable economic incentive 
for further research. 

2.2 Numerical solutions for different regimes of condensation 

The numerical scheme for the calculation of steady as well as unsteady non-equilibrium 
wet steam flow has been detailed in Guha & Young (1991) and Guha (1994a). Here, we 
describe only the outline and highlight some important aspects. 

One of the most effective methods of calculation is to write a computational “black-box” 
which contains the nucleation and droplet growth equations, and the energy equation in 
its thermodynamic form. Together they furnish the full set of equations that describe com¬ 
pletely the formation and growth of liquid droplets in a fluid particle (from a Lagrangian 
viewpoint), if the pressure-time variation is specified. The pressure-time variation is ob¬ 
tained by time-marching solutions of the conservation equations such as Denton’s method. 



extensively used for single-phase calculations in turbomachinery blade rows. In this re¬ 
spect, the thermodynamic aspects of phase-change can be completely divorced from fluid 
dynamical considerations so that the use of the “black-box” is effectively independent 
of any particulai’ computational fluid dynamic application. Thus established single-phase 
CFD codes can, rather easily, be modified to deal with non-equilibrium two-phase flow 
with the above-mentioned modular approach. (The flexibility of this scheme may be ap¬ 
preciated from Guha & Young (1994) where the same “black-box” has been grafted into 
a streamline curvature calculation procedure.) 

The development of the computational routines within the “black-box” represents a 
comparatively major undertaking and has been fully described by Guha & Young (1991) 
and Guha (1994a). The routines are sufficiently general and robust to deal with any type of 
nucleating or wet steam flow and (in contrast to many procedures reported in the literature) 
full details of the polydispersed droplet size spectrum following nucleation are retained 
in the calculations. The last aspect is essential for accurate modelling of the nucleation 
zone. This has been possible, without consuming excessive CPU time, by developing a 
novel averaging procedure that constantly redefines the average size and droplet number 
in each droplet group. In this way, the number of droplet groups required is restricted to an 
affordable optimum, while always retaining the correct shape of the droplet size spectrum. 

A mixed Eulerian-Lagrangian technique is used. The continuity and momentum equa¬ 
tions are solved by Denton’s time-marching method. (The wet-steam “black-box” being 
flexible and modular, any other Eulerian time-marching method, e.g. Jameson’s scheme, 
can be used.) The “black-box” performs the integration of the droplet growth equations 
along the fluid path lines rather than the usual, quasi-unsteady, method in which the pres¬ 
sure field remains frozen at a given instant of time while the growth of the liquid phase 
is calculated. The present scheme allows simultaneous solution of all the relevant equa¬ 
tions and enforces the correct coupling between the vapour-phase gasdynamics and the 
relaxation effects due to the presence of the liquid phase. 

For a proper comparison between experiments and theory, variation in pressure as well 
as droplet size must be considered. Many references compare the variation in pressure only. 
Such comparison is an inadequate test for nucleation and droplet growth theories. Almost 
any nucleation theory can be ‘tuned’ to reproduce the measured pressure distribution. 
A crucial test is to find out whether the same, ‘tuning’ can predict the correct droplet 
size as well. Experience with calculations for wet steam points out categorically that, in 
general, predicting a satisfactory pressure distribution does not automatically ensure a good 
prediction of droplet size. The present computational scheme has been validated against 
measurements of steady (both sub- and supercritical) and unsteady condensation shock 
waves (Guha & Young 1991). Various regimes of condensation have been computed by 
Guha (1994a), which shows a novel example of subsonic condensation where the nozzle 
is unchoked at the geometric throat. (In most reported studies on condensation shocks in 
nozzles, condensation takes place in the supersonic divergent part.) 

Figure 1 presents one example of unsteady calculation. The prediction compares well 
with measurement. The pressure profiles at different instants during a complete cycle reveal 
exactly the same sequence of the formation and movement of the shock wave as explained 
earlier. As the aerodynamic shock wave moves upstream towards the throat and interacts 
with the nucleation zone, progressively fewer droplets are nucleated, thus resulting in a 
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Figure 1. Unsteady condensation: Evolution of pressure for one cycle. 


larger final mean radius. This causes a large variation in the droplet sizes during each 
cycle. An interesting implication of the unsteady nucleation process is that it may be a 
contributing factor to the formation of the highly-skewed polydispersed droplet spectrum 
measured in a real steam turbine: a polydispersity which cannot be predicted with steady 
flow calculation methods (see § 4). Details of this and other calculations are in Guha & 
Young (1991) and Guha (1994a). 

2.3 Integral analysis: Condensation wave theory 

A great deal of physics may be learnt from an integral analysis, which is a study of the 
jump conditions relating the end states of the condensation zone, without considering the 
detailed flow structure within it. Figure 2 shows an example calculation of the condensation 
wave theory for air with 1% moisture content. Guha (1994a) presents condensation wave 
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Figure 2. Condensation wave theory. 





theory in great detail and discusses similarities and differences with the more familiar 
combustion wave theory. (The reference considers control volume analysis of two types of 
near-discontinuities, condensation shock as well as aerodynamic shock waves in vapour- 
droplet flow, from a common standpoint.) In figure 2, M/i represents the upstream Mach 
number based on the frozen speed of sound a/ and Me 2 represents the downstream Mach 
number based on the equilibrium speed of sound ae (§ 3.2). The effect of heat release is 
very pronounced when the Mach number lies approximately between 0.8 and 1.1, and a 
small amount of condensation may alter the flow velocity etc., quite dramatically. Details 
may be found in Guha (1994a). 

2.4 Thermal choking due to non-equilibrium condensation 

It is well known that heat addition causes a reduction in Mach number in supersonic flow 
and an increase in Mach number in subsonic flow. In other words, heat addition to a flowing 
fluid drives the Mach number towards unity. Therefore, at a particular flow Mach number, 
the fluid can absorb a maximum quantity of heat before the local Mach number equals 
unity and the flow becomes thermally choked. Any elementary textbook on classical gas 
dynamics gives the expression for critical quantity of heat, ^classical ’ simple heat addition 
to an ideal gas (external heat addition without any change in flow cross-sectional area). 

Similar to the case of external heat addition, the Mach number decreases in the conden¬ 
sation zone (the flow being supersonic). Therefore, for particular combinations of nozzle 
geometry, supply conditions and the working fluid, the liberation of latent heat could be 
such that the minimum Mach number becomes unity and the flow is thermally choked. (A 
numerical computation of this limiting case of thermal choking due to non-equilibrium 
condensation is later shown in figure 4.) If the inlet total temperature, T^i, is reduced any 
further, keeping the inlet total pressure, po\, fixed, continuous variation of the flow vari¬ 
ables is no longer possible and an aerodynamic shock wave appears inside the condensation 
zone. 

Although widely refen-ed, it has been argued in Guha (1994b) that the expression for 
^dassical appropriate for a condensing flow primarily for two reasons: (i) In case 
of a condensation shock, the heat is added as a result of condensation of a part of the 
fluid itself. Therefore, the mass flow rate of the condensable vapour changes as the vapour 
is continually transformed into the liquid phase. The expression for which is 

derived for external heat addition to an ideal gas, does not take into account this mass 
depletion, (ii) The droplets formed through homogeneous nucleation grow at 2 ifinite rate by 
exchanging mass and energy with the surrounding vapour. Therefore, the energy addition 
due to condensation is not instantaneous and takes place over a short but finite zone. Since 
condensation shock normally occurs in the diverging section (with dry vapour at inlet), 
this means the flow area increases between the upstream (Ai) and downstream (A 2 ) of the 
condensation zone. The expression for <?ciassicai’ hand, is derived by assuming 

heat addition in a constant area duct. 

Guha (1994a) gives details about the relative importance of the above two effects. Two 
new expressions for critical quantity of heat have been derived: which deals with 

external heat addition with area variation, and which takes into account the area 

variation as well as the depletion in the mass of vapour due to condensation. The dotted 


Figure 3. Effects of area ratio across 
the condensation zone and the depletion 
in vapour mass flow rate on the critical 
amount of condensation (— - 

^integral^’ 

lines in figure 3 represent ^integral while the solid lines are the plots of ^ J. Note that the solid 
line corresponding to unity area ratio {A 2 /A 1 = 1) is nothing but the classical solution, 
^classical- Figure 3 shows clearly that the effects of mass depletion and (even a small) area 
variation are quite dramatic, especially when the Mach number is close to unity. 

Figure 4 presents a time-marching solution of a limiting condensation shock, giving the 
variation in Mach number (M), non-dimensional pressure (p/poi) and non-dimensional 
wetness fraction {y/yeq) along the nozzle axis. There are two sonic points in the flow field. 
(Achieving exactly a Mach number of unity at the end of the condensation zone (point 2) 
required numerical experiments of varying To\ keeping po 1 fixed.) At the conditions shown 
in figure 4 A 2 /A 1 — 1.0284, Mi = 1.182. The critical amount of heat obtained from this 
full numerical solution of the nozzle flow is denoted by . Table 1 compares the results 
of various integral analyses. 

In conclusion, a theory of thermal choking due to non-equilibrium condensation in 
a nozzle is presented (Guha 1994b). The theory is based on a simple control volume 
approach. (A differential theory of thermal choking is discussed in Guha (1994a).) It 
applies to vapour-droplet flow with or without a carrier gas. The expression for critical 
heat (or condensation) derived is valid for either supersonic or subsonic flow, and for heat 
release either in the diverging or in the converging part of a nozzle. 
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Figure 4. Time-marching solution of a 
limiting condensation shockin aquasi-one- 
dimensional convergent-divergent nozzle. 
{Continuous variation of flow variables 
leading to thermal choking at point 2.) 
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External heat addition to ideal gas 

Condensational heat 
release with area 
variation and mass 

Time-marching solution 
of non-equilibrium 

in constant area duct 
(‘?dassical/^Pg^«l) 

in duct of varying area 
(qycpjol) 

depletion of vapour 
(^integral/'^PS^ol) 

gas dynamic equations 

(^actual/^PS^ol) 

0.01978 

0.0423 

0.0574 

0.0587 


Table 1 shows that the present theory, <?integrai’ agreement with the full 

numerical solution of the differential equations of motion (giving the detailed structure 
of a condensation shock wave leading to thermal choking). The usually quoted <?*iassical 
underestimates the critical heat by a factor of three in the example calculation presented. 
The variation of area across the condensation zone (although small) and the depletion in 
vapour mass as a result of condensation cannot be neglected in determining the critical 
heat in condensing nozzle flow. 


3. Fluid dynamics with interphase transport of mass, momentum and energy in 

pure vapour-droplet mixtures 

3.1 Relaxation gas dynamics for vapour-droplet mixtures 

A lucid description covering many aspects of relaxation gas dynamics and its applications 
to vapour-droplet flows (including coupled relaxation processes) may be found in Guha 
(1995). 

If a property of a medium is perturbed from its equilibrium state and the restoration 
to equilibrium occurs at a finite rate, the medium is called a relaxing medium and the 
process of restoration is termed relaxation. Simple relaxing media follow the archetypal 
rule of the restoration process following a perturbation: d{Aq)/dt = —Aqlz, where, q 
is an internal state variable, Aq is the departure from equilibrium and r is the relaxation 
time. 

A vapour-droplet medium is assumed to be a homogeneous two-phase mixture of a large 
number of fine, spherical droplets dispersed in the continuous vapour phase. Although the 
droplet cloud may exhibit an arbitrary level of polydispersity with a spectrum of different 
sizes of droplets, in the present analysis we will restrict ourselves to a mono-dispersed 
droplet population for simplicity of description. We also consider pure substances only. 
The droplets are large so that the capillary subcooling is negligible. 

A vapour-droplet medium may go out of equilibrium in three different ways as below. 

(i) The droplet temperature is not equal to die saturation temperature (T/ 7 ^ Ts). 
ATi — Ts — Ti is the relevant non-equilibrium variable and r^) is the corresponding 
droplet temperature relaxation time. 

(ii) The two phases have unequal velocities, i.e., there may be a slip between the phases 

^ V/). AV = Vg — Vi is the non-equilibrium variable and r/ is the corresponding 
inertial relaxation time. 
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(iii) The temperature of the vapour phase may be different from the saturation temperature 
(Tg Ts). AT = Ts — Tg is the relevant non-equilibrium variable and tt is the 
corresponding vapour thermal relaxation time. [Note that negative AT means that 
the vapour is superheated.] 

It is clear that under non-equilibrium conditions mass, momentum and energy transfers 
would take place between the two phases. It is through these interphase transfer processes 
that the system returns to equilibrium following a perturbation. The equations representing 
the different inteiphase transfer processes and the expressions for rp, zj and tt have been 
derived in Young & Guha (1991) and Guha (1995). Figure 5 shows the variation of the 
three relaxation times in pure steam {p = 0.5 bar, y = 0.1) as a function of droplet radius. 

It can be shown (Guha 1992c) that, quite generally, the relation zp <$: z/ zp 
holds in pure vapour-droplet flow. Therefore, following a disturbance, on a very short 
time scale the droplet temperature reaches equilibrium, then the velocity slip and finally 
the vapour temperature relaxes to the equilibrium value. While considering any particular 
relaxation process, therefore, relaxation processes with smaller relaxation times may be 
assumed to have been equilibrated whereas relaxation processes with larger time scales 
may be assumed to be frozen. Mathematically, for example, equilibrium droplet temper¬ 
ature means that AT/ and zp simultaneously tend to zero, the ratio ATi/zp remaining 
finite such that ATijzp V/(d7]f/dx), and, for instance,heat transfer implies 

XT -> 00 . 

3.2 Speeds of sound in a pure vapour-droplet mixture 

As a result of three relaxation processes in vapour-droplet flow, four different sound speeds 
(< 2 /, Oei, ae 2 , ae) may be defined subject to different mechanical and thermodynamic 
constraints. The full frozen speed of sound, ay, corresponds to the speed of an harmonic 
acoustic wave of such high frequency that the response of the droplets is negligible (i.e. 
zero mass, momentum and energy transfer). The full equilibrium speed of sound, ae, 
corresponds to the speed of an harmonic acoustic wave of such low frequency that liquid- 
vapour equilibrium is maintained at all times. The two intermediate sneeds r*r»rrecnr»nrl 



Figure 6. Schematic structure of shock 
waves, (a) Partly and (b) fully dispersed 
waves. 

Iicat translcr ), tind (b) the case of equilibrium droplet temperature and velocity slip 
u‘la\aiiun but frozen heat transfer (r/<- 2 )- 

flic derivation and the expressions of the sound speeds may be found in Young & Guha 
t 1 1 1 atid (luhii (1). The relationship of the four sound speeds to each other is of great 

iinpurtancc. As ti typical example, the ratios of different sound speeds for steam at 1 bar 
and i). 1 wetness fraction arc given by, af. a^i: ae 2 ’ «£. = !:0.997:0.945:0.878. 

Stnu'iurr of stationary shock waves 

C )ne can ilehne at least two Mach numbers corresponding to the limiting speeds of sound: 
a tro/en Mach mnnhcr Mf ~ V’/r/y, and an equilibrium Mach number Mg = Y/ag,where 
is the How velocity. In general, r/g < a/ and Mg > Mf. As a result of the frequency 
dispersitm, therefore, two distinct types of shock waves might form in a relaxing medium, 
the far upstream and far downstream conditions being at equilibrium (figure 6 ): (1) A partly 
dispersed shock wave, where a discontinuous jump in flow variables (dominated by viscos¬ 
ity and thermal conductivity as in Rankine-Hugoniot shocks in classical gas dynamics) is 
tollowed by a continuous relaxation zone in which the fluid returns to equilibrium by per¬ 
forming the relaxation processes. Such a case arises if the upstream velocity is greater than 
the frozen .speed of .sound, i.e., if > a/o or, Mf„ > 1. In order to denote the upstream 
condition, we use both subscripts o (so that confusion does not arise with sound speeds 
andt/g 2 )tind 1 (as this is the conventional nomenclature for Rankine-Hugoniot condi¬ 
tions discussed in § 3.5). (2) A fully dispersed shock wave, where a continuous change of 
flow prtiperties takes place from the upstream to the downstream equilibrium state. This 
corre.sponds to the case when ajp < V^o < i-C-. whenM/g, < l,butMgo > 1 . 

With lour limiting speeds of sound in a vapour-droplet mixture, four types of stationary 
shock wave structures may arise (Young & Guha 1991; Guha 1992c, 1995); 

type I waves corresponding to Ueo < Vgo < cieio, 

type II waves corresponding to a^o < 

type in waves corresponding to a^\o < < «/o> 

partly dispersed waves corresponding to Vgo > ^fo- 

Type I, II and III waves are sub-categories of the fully dispersed waves, where the 
steepening effect of the nonlinear terms in the equations of motion is just balanced by the 
dispersive effect of the relaxation processes. Type I waves are dominated by vapour thermal 
relaxation, type II waves by both velocity and vapour thermal relaxation and type HI waves 
by all three relaxation processes. It has been shown (Young & Guha 1991; Guha 1995) 
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that I AT/1 becomes unstable in the interval ae\ < Vg < a/, |AV| becomes unstable in 
the interval ae 2 < Vg < Oei, and, | AT] becomes unstable in the interval ae < Vg < ae 2 - 
These instabilities in the non-equilibrium variables are the reasons for the existence of 
the fully dispersed waves. Figure 7 shows numerical solutions for a typical type II fully 
dispersed wave in a steady flow of pure wet steam. Numerical solutions of other types of 
waves may be found in (Guha & Young 1989; Guha 1992b, 1995). 

The usual model of a partly dispersed shock wave assumes that the interphase transfer 
processes are frozen during the passage through the discontinuity and the vapour properties 
just downstream of the discontinuity can be calculated using a standard Rankine-Hugoniot 
analysis. The liquid droplets therefore pass through the frozen shock without change in ra¬ 
dius, temperature and velocity. The conditions downstream of the discontinuity provide the 
initial values for integrating the two-phase conservation equations through the relaxation 
zones. The droplet temperature relaxes very quickly, followed by velocity slip and finally 
the vapour temperature. The lengths of the relaxation zones are in the approximate ratios 
td'- tj: Tf. Details about partly dispersed waves in vapour-droplet flow may be found in 
(Guha 1992b). Although linearized analyses are often presented for the relaxation zone 
downstream of the frozen shock in standard treatments on relaxation gas dynamics, they 
are of limited applicability to vapour-droplet flows (Guha 1992b). 

3.4 Shock waves in unsteady flow 

The physical significance of the various wave profiles discussed above can be appreciated 
more readily by considering their development under unsteady flow conditions. As a typical 
example, we now discuss wave generation in one-dimensional flow by an instantaneously 
accelerated piston in a frictionless pipe initially containing stationary wet steam. The 
numerical scheme and other details may be found in Guha & Young (1989). 

Figure 8 shows the numerical prediction of a wave propagating in wet steam. The figure 
also includes the flow behaviour in a dry, ideal gas under identical conditions in order to il¬ 
lustrate the special features of a vapour-droplet mixture. The (t-x) diagram was constructed 
from the results of the unsteady time-marching calculation. At the instant of initiation, all 
the interphase transfer processes are frozen and the shock velocity corresponds to the prop¬ 
agation velocity into a single-phase vapour at the same temperature. Behind the shock, 
the mixture relaxes to equilibrium along the particle pathlines. The droplet temperature 
relaxes first on the very short timescale ro and is followed by the velocity slip and vapour 
temperature on timescales r/ and tt respectively. Changes along the particle paths are 
propagated upstream and downstream along the left and right running Mach lines (based 
on the frozen speed of sound). The right running Mach lines overtake the shock wave, 
weakening it and causing it to slow down. The shock path therefore curves in the (t-x) 
diagram until it reaches a constant equilibrium speed. When this occurs, the dispersive 
effects of the relaxation processes are just balanced by the steepening effects of the non¬ 
linear terms and the wave structure is identical to that of the stationary waves in steady 








Figure 8. Numerical prediction of shock propagation in a pipe. 


superheat vapour temperature, droplet radius and wetness fraction is shown by the curves 
in figure 9 which are self-explanatory. As with stationary partly dispersed shock waves, 
the increase in wetness fraction downstream of the frozen shock wave is due to the effects 
of velocity slip. 




3.5 Integral analysis: Jump conditions 

Detailed study on the jump conditions across shock waves has been made by Guha (1992a, 
1994a) and the similarities and differences of condensation discontinuities and aerody¬ 
namic shock waves are discussed at length (Guha 1994a). Here, we can only mention the 
bare minimum. 

In a simple relaxing medium, e.g., a solid-particle-laden gas, the form of the jump 
conditions (R in figure 6) are identical to the Rankine-Hugoniot relations for an ideal gas. 















Figure 9. Variation of flow parameters during shock propagation in wet steam. 


if one uses the equilibrium Mach number, M^, and the equilibrium isentropic index, 
(Expressions for ye in vapour-droplet mixtures with or without a carrier gas are given in 
Guha 1992a, 1994a) However, it has been shown by Guha (1992a) that although jump 
conditions of the same form as Rankine-Hugoniot relations can be formulated for vapour- 
droplet mixtures, they are approximate and hold only conditionally. 

A solid-particle-laden gas can be treated as a modified gas. Equations governing vapour- 
droplet flow are much more complex. An additional complexity (and novelty) occurs due 
to interphase mass transfer (the liquid phase evaporates inside a dispersed wave). If the 
strength of the wave is sufficient, complete evaporation may result, thereby rendering a 
two-phase medium before a shock wave into a single-phase one after it! 

The integral analysis (Guha 1992a) reveals that, depending on the upstream wetness 
fraction and the pressure ratio across the wave, four types of shock structures may result in 
vapour-droplet flow. They are: (I) equilibrium fully dispersed, (H) equilibrium partly dis¬ 
persed, (m) fully dispersed with complete evaporation, (IV) partly dispersed with complete 
evaporation. Figure 10 shows the boundaries of the four regimes in low-pressure wet steam. 

Jump conditions across all types of aerodynamic waves are derived by Guha (1992a, 
1994a), Figure 11 shows the predictions of the integral analysis compared with numer¬ 
ical solutions of the wave profile as discussed in §3,3. The dotted lines in figure 11 
represent the jump conditions. The solid lines represent numerical solutions for three 
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shock structures in low-pressure steam. 

categories of shock waves: (I), partly dispersed with complete evaporation (pi = 0.35 bar, 
M/i = 1.6, yi = 0.05, r] = 0.1 pm); (II) equilibrium partly dispersed (pi = 0.35 bar, 
Mfi = 1.2, yi = 0.05, r\ =0.1 pm); (III) equilibrium fully dispersed (pi = 0.35 bar, 
Myi = 0.97, y\ = 0.1, ri = 0.1 pm). Results from integral analyses agree remarkably 
well with solutions of the differential equations of motion, thereby confirming independent 
theoretical consistency. 

It is interesting to study the various mechanisms of entropy creation inside a dispersed 
shock wave (Guha 1992a). In addition to the effects of viscosity and thermal conduction, 
entropy is created due to the relaxation processes. Figure 12 shows the rise in enffopy across 
shock waves as a function of upstream frozen Mach number. Two cases are considered: 
dry steam, and, wet steam with upstream wetness fraction 0.1. Shock waves may occur in 
wet steam even when M/i < 1, due to the existence of fully dispersed waves. In a partly 
dispersed wave, entropy rises across the frozen shock as well within the relaxation zone 
that follows. The figure shows that the contribution of the relaxation processes is very 
significant in the overall creation of entropy. 

In an ideal gas, the total rise in entropy across a shock wave is fixed by Rankine- 
Hugoniot relations, the magnitudes of viscosity and thermal conductivity merely determine 
the thickness of the shock wave. It has been shown (Guha 1992a) that the total rise in entropy 
across a dispersed shock wave in vapour-droplet mixtures is similarly fixed by the jump 
conditions. In exact analogy with the role of viscosity, various relaxation processes and 
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Figure 11. Comparison of asympto¬ 
tic pressure profiles with generalised 
Rankine-Hugoniot equations. 
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Figure 12. Entropy rise across shock 
waves in low-pressure steam. 

e shock wave. Overall jump conditions in 
any property, including entropy, can be determined without any explicit reference to the 
processes which make the ‘jump’ happen! 

An integral analysis as well provide much insight into the unsteady development of ^' 
shock waves. Details can be found in (Guha 1995; Guha 1992c). 

3.6 Interpretation of total pressure and temperature in two-phase flow 

In this section we discuss briefly some interesting effects of the non-equilibrium, interphase 
transfer mechanisms in a stagnation process in two-phase flow. Pitot measurements are 
often used for inferring velocity or loss (entropy generation) in multiphase mixtures. In 
single-phase fluids, the fluid is assumed to be brought to rest at the mouth of the Pitot 
tube isentropically. Hence flow Mach number and entropy generation (in steady, adiabatic 
flow) are uniquely determined by the total pressure measured by a Pitot tube, together with 
an independent measurement of the static pressure. (In supersonic flow in an ideal gas, 
application of Rankine-Hugoniot equations across the detached shock wave in front of a 
Pitot tube retains the utility of Pitot measurements for deducing flow Mach number and 
entropy generation.) Pitot measurements in a multiphase mixture, however, require careful 
considerations. Similar considerations are also needed for interpreting total temperature. 

Guha (1997a) gives the details of the physical considerations required and the description 
of a unified theory for the interpretation of total pressure as well as total temperature. 

The solid particles or the liquid droplets respond to changes in temperature, velocity 
etc. of the gas phase through interphase exchanges of mass, momentum and energy. These 
ai'e essentially rate processes and hence significant departures from equilibrium can take 
place if the rate of change of external conditions, imposed by the deceleration in the 
stagnating flow, is comparable to the internal time scales. Thus, for example, if the size s 
of the liquid droplets or the solid particles is very small, then inertial and thermodynamic 
equilibrium between the two phases are maintained always, and a Pitot tube would measure I 
the equilibrium total pressure, On the other hand, if the size of the droplets or the 
particles is very large, all interphase transfer processes remain essentially frozen. The 
Pitot tube records the pressure which it would have recorded if the vapour phase alone 
was brought to rest from the same velocity. The total pressure in this case is termed the 
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frozen total pressure, pof. Analytical expressions for poe and pof, both in vapour-droplet 
and gas-particle flow, are given (Guha 1997a). 

As an example, consider low-pressure wet steam with a typical wetness fraction of 10% 
and at a Mach number 1.5. Calculations show that pof jp = 3.3 and Poe!P = 3.79, where 
p is the static pressure. Therefore, in this particular example, the equilibrium total pressure 
is about 15% higher than the frozen total pressure. 

It is expected that for intermediate sizes of the droplets or particles, the pressure recorded 
by the probe would neither be the equilibrium nor the frozen value. The imposed decelera¬ 
tion in front of the Pitot tube would cause the two-phase mixture to deviate from equilibrium 
conditions, both inertially and thermodynamically. The deceleration process consequently 
ceases to be isentropic, as non-equilibrium exchanges of mass, momentum and energy 
between the two phases create entropy. 

Guha (1997a) has considered a large number of two-phase mixtures, both gas-particle 
and vapour-droplet, at subsonic as well as supersonic velocities for many different sizes of 
the droplets (or particles). In the supersonic case a detached frozen shock wave stands in 
front of the Pitot tube. The relaxation mechanisms in a gas-particle mixture are different 
from those in a vapour-droplet flow. Despite all these complexities and differences, it was 
possible with proper non-dimensionalization of flow parameters to adopt a universal plot, 
within acceptable tolerance, of non-dimensional total pressure, Rp, versus Stokes number, 
St (which is a non-dimensional representation of particle size). Rp and St are defined 
by, Rp = {po — Pof)/(.Poe “ Pof) and St = xjVoo/L, where, Voo is the unperturbed 
velocity of the two-phase mixture far upstream of the measuring device, po is the pressure 
attained at the measuring point under non-equilibrium conditions (the total pressure which 
is measured) and L is a characteristic length (in subsonic flow L is related to the Pitot 
diameter, in supersonic flow L is related to the distance between the frozen shock wave 
and the Pitot mouth). Larger droplets or particles correspond to higher St. 

Figure 13 shows the variation of Rp with St, which may be adopted as the Pitot correc¬ 
tion curve usable at a wide range of subsonic and supersonic Mach numbers and for any 
two-phase mixtures (vapour-droplet or gas-particle). The variation is monotonic. It should 
be noted that the denominator in the expression for Rp is calculated using the equilibrium 
thermodynamics, whereas the numerator is calculated using non-equilibrium equations. 
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Figure 13. Near-universal plot of Rp ver¬ 
sus St: comparison of a simple analytical 
theory with numerical solutions. 





Figure 14. Entropy rise versus Stokes 
number in air with different solid particles 

8 = 0.1 (-), 0.8 (-), 1.2 (-), 

4.0 (-). 


That the value of Rp, shown in figure 13, indeed tends to unity and zero in the appro¬ 
priate limits of St, demonstrates independent theoretical consistency of the calculation 
schemes. 

In addition to these numerical calculations, an analytical theory for determining to¬ 
tal pressure under non-equilibrium conditions has been formulated. The analytical the¬ 
ory is simple and is amenable to direct physical interpretation. The theory shows that 

= 1/(1 4- St), The predicted total pressure correctly reduces to the frozen total pressure 
ill the limit of large Stokes number (large particles) and to the equilibrium total pressure 
in the limit of small Stokes number (small particles). Maximum dependence of the total 
pressure on Stokes number is observed when the Stokes number is of the order unity. The 
analytical result is also plotted in figure 13 for comparison. Under non-equilibrium condi¬ 
tions for intermediate St, the prediction of this equation compares very well with results 
from full numerical solution of the gas dynamic equations for two-phase mixtures. 

Figure 14 plots the rise in mixture entropy, as mixtures of air and solid particles are 
decelerated by a measuring probe from their far upstream velocity to rest. Four different 
solid particles (hypothetical) with 5 = cijcpg =0.1, 0.8, 1.2 and 4 are considered and 
the calculations ai'e done for two Mach numbers. For the subsonic case (M/oo = 
figure 14 shows that the rise in entropy is indeed maximum when St ~ 1, and is almost 
zero in the frozen and equilibrium limits. (Recall from figure 13 that the total pressures 
are different in these limits.) At M/oo = 1-5, the entropy rise is again maximum close to 
St ~ 1, but it has a finite value both at St 0 and at St -> oo. The rise in entropy in 
the limit St -> oo is simply that across the frozen shock. (Since the same frozen shock 
is involved in all cases because the same M/oo is used, this increase in entropy is the 
same for all four mixtures considered.) The rise in entropy in the limit St -> 0 is, how¬ 
ever, different for different mixtures (it depends on the isentropic index of the mixture and 
hence on 5), However, it is shown (Guha 1992a) that if the particles come to equilibrium 
downstream of a frozen shock wave, then the entropy rise (across the shock plus the relax¬ 
ation zone) is not dependent on the particle size (and hence on the relaxation times) but is 
determined completely by Rankine-Hugoniot equations for two-phase flow. This fact is 
reflected in the straight, horizontal portions of the curves (at M/oo == 1.5) in figure 14 in the 
limit St 0. 
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The rate of entropy production in a multiphase mixture is maximum when the Stokes 
number is of the order unity (in accordance with other results of relaxation gas dynamics), 
and a reduction in measured total pressure is not unequivocally related to a rise in entropy 
(as it is in steady, adiabatic flow of single-phase fluids).The fact that the total pressure 
decreases monotonically from poe to Pof as St changes form 0 to oo whereas the entropy 
rise is zero at both limits and has a maxima when St ~ 1, demands care while interpreting 
Pitot measurements in multiphase flow. 


4. Application of the non-equilibrium theory to steam turbines 

4.1 The formation of the liquid phase 

An introduction to the flow through steam turbines may be found in Guha (1995). In a 
multistage steam turbine used in power plants for generating electricity, the steam enters the 
low-pressure (LP) turbine cylinders as a dry superheated vapour but becomes wet towards 
the last stages. Experiments show that the water in turbines exists in two quite different 
forms. Usually more than 90% of the mass is concentrated in the fog, which consists of a 
very large number of very fine droplets (diameter 0.05-2.0 pim). The rest is in the form of 
coarse droplets which are very much larger (diameter 20-200 p-m). Coarse water is formed 
as a small proportion of the fog (typically 2-3% per blade row) is deposited on the blade 
surface either due to inertial impaction or through turbulent diffusion. The deposited water 
is drawn towards the trailing edge by the steam flow (or centrifuged towards the casing on 
moving blades), where it is re-entrained in the form of large droplets. The lai’ge droplets 
cause blade erosion, but their thermodynamic and mechanical effects on the steam flow 
can nearly always be neglected. 

Formation of new droplets occurs only over a small part of the expansion in a turbine. 
As a fluid particle flows through the machine, typically the complete nucleation process 
takes only 10-20 p,s, as compared with a typical flow transit time of 5-10 ms through 
an LP turbine. Thus most of the expansion in the turbine simply involves condensation 
on existing droplets. However, nucleation is of crucial importance as it is the process 
which establishes the final fog droplet size distribution which, in turn, determines the 
subsequent departure from equilibrium affecting the flow behaviour, the magnitude of 
the wetness loss and the rate of fog droplet deposition on the blading forming coarse 
water. Once the droplet size distribution can be accurately predicted, the analysis of the 
wet steam flow through the rest of the turbine rests on more solid foundation. However, 
currently no theory exists (see § 4.2 for a novel theory) which gives even remote agreement 
with the available experimental measurements of the size distribution of fog droplets in 
turbines. 

Our inability to understand the nucleation process in steam turbines is surprising given 
the success with which spontaneous condensation in laboratory nozzles and stationary, 
two-dimensional, laboratory cascade of steam turbine blades can be predicted using a 
synthesis of the classical theories of homogeneous nucleation and droplet growth with 
the conservation equations of gas dynamics. Such calculations have now been refined to 
the extent that the theory, amended by only a modicum of empiricism, gives acceptable 



Figure 15. IVpical droplet size distribu¬ 
tions in a supersonic condensing steam noz¬ 
zle (computed) and in the exhaust of a 
low pressure, electricity generating steam 
turbine (inferred from optical transmission 

data). • nozzle; o turbine ;-Sauter 

mean;-mass mean. (Turbine measure¬ 

ments courtesy of PT Walters, National 
Power Technology and Environment Cen¬ 
tre, Leatherhead, UK.) 

agreement in terms of pressure distribution and mean droplet diameter with most experi¬ 
ments reported in the literature. 

As shown in figure 15, calculations of droplet size spectra in condensing steam nozzles 
usually indicate a narrow distribution with a comparatively small mean droplet diameter 
strongly dependent on the local expansion rate near the Wilson point. Because of the 
narrowness of the distribution, optical experimental techniques based on measuring the 
attenuation of light of different wavelengths are unable to resolve the details of the spectrum 
but register results consistent with a near monodispersed droplet population. In turbines, 
however, experimental determinations of droplet size spectra give quite different results, 
the optical characteristics of the medium invariably indicating a broad, strongly skewed, 
distribution with a much larger mean diameter, typically in the range 0.2-0.6 (xm. Quite 
often, the distribution is bimodal with a significant proportion of the total mass of liquid 
contained in a secondary population of droplets having diameters in the range 0.4-1.0 [xm. 
A typical distribution, inferred from fight attenuation measurements in a low-pressure 
steam turbine used for electricity generation, is also shown in figure 15. 

Early attempts to describe the process of phase-transition in steam turbines followed 
the work of Gyarmathy who modelled the turbine by a series of one-dimensional nozzles. 
Reversion to equilibrium from the supersaturated vapour state was predicted to occur at a 
well-defined Wilson point in a particular blade passage giving rise to a near-monodispersed 
population of usually rather small droplets. The nucleation zone was thought to extend 
over a very short distance in the flow direction and calculations indicated that the mean 
droplet size should be very sensitive to turbine inlet conditions, small changes of which 
had the potential for displacing the Wilson point to new locations of quite different expan¬ 
sion rate. Later calculations of nucleation and droplet growth in two-dimensional turbine 
cascades displayed characteristics essentially similar to those found in one-dimensional 
condensation studies, albeit in more complicated and realistic fluid flow fields. 

As noted above, optical measurements of wetness fraction and droplet spectra in tur¬ 
bines tell quite a different story. The measured spectrum is always broader, and the mean 
diameter larger, than conventional calculations indicate. Furthermore, the spectrum is com¬ 
paratively insensitive to small changes in turbine inlet conditions: Measurements taken on 
the same machine over a period of years show excellent reproducibility. Finally, it appears 




laboratoi7 nozzles but generally occupies much greater distances in the now direction. This 
behaviour can be deduced from optical measurements which show that the wetness frac¬ 
tions at certain locations in some turbines are considerably lower than the local equilibrium 
values. 

It is therefore evident that the nucleation of water droplets in turbines involves phenom¬ 
ena which are not reproduced by laboratory experiments on nozzles and stationary cascades 
but nevertheless play a dominating role in the process of phase-transition in real machines. 
Possible explanations include nucleation in blade-wake vortices, heterogeneous conden¬ 
sation due to the effects of impurities in the steam and the effects of blade-row interaction 
unsteadiness. In § 4.2, consideration is given to the third of these possibilities, namely the 
effect of the unsteadiness due to the interaction of blade-wakes with downstream blade 
rows. It will be shown that a direct result of these interactions is to dramatically broaden 
the droplet size distribution giving a general shape and mean diameter much more in keep¬ 
ing with experimental measurements in real turbines. Furthennore, the theory predicts 
that the formation of the liquid-phase takes place in an essentially unsteady manner and 
may encompass a region in the flow comprising one or more complete turbine stages in a 
multi-stage machine. The theory presents a radically different perspective of nucleation in 
turbines from the generally accepted view and, if correct, should have a major influence 
on the future development of calculation procedures for non-equilibrium steam flows in 
turbines. 

4.2 Effects of unsteadiness on the homogeneous nucleation of water droplets in multi¬ 
stage steam turbines 

The details of the theory are given by Guha & Young (1994) and Guha (1995). The essence 
of the theoiy is that large-scale temperature fluctuations caused by the segmentation of 
blade wakes by successive blade rows have a dominating influence on nucleation and 
droplet growth in turbines. The fundamental premise is that, in passing through a multi¬ 
stage turbine, different fluid particles undergo different fluid mechanical experiences de¬ 
pending on the exact details of their passage through the machine and hence arrive at a 
given axial location with a wide variety of thermodynamic conditions. It is further assumed 
that, downstream of any turbine stage, the pressure of all the fluid particles would be near¬ 
uniform but their specific entropies (and hence static temperatures) would vary greatly 
depending on the dissipation experienced by a particular particle due to its being entrained 
in one or more blade boundary layers or lossy regions of the flow. However, although the 
path taken by a fluid particle is assumed to be random, the time-averaged dissipation of all 
the particles should agree with the overall loss distribution in the turbine. This is assumed 
known, either by direct measurement or from empirical loss correlations. 

Figure 16 is a schematic diagram of the way in which the wakes from one blade row 
interact with, and are segmented by, the following row. It can be clearly seen that dissipation 
occurring in successive blade rows can become superposed in certain fluid particles (tlie 
darkly shaded areas). 

A Lagrangian frame of reference is adopted and attention is focussed on a large num¬ 
ber of individual fluid particles during their passage through the turbine. Homogeneous 
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Figure 16. Schematic diagram of the pas¬ 
sage of blade wakes through successive 
blade rows. 
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nucleation and growth of droplets in each fluid particle is assumed to be governed by clas¬ 
sical theories. All fluid particles are assumed to experience the same pressure variation but 
those particles passing close to the blade surfaces suffer greater entropy production and 
therefore have higher static temperatures than those which pursue near-isentropic paths 
through the central portions of the blade passages. Particles which suffer high loss therefore 
nucleate later in the turbine than those which experience little dissipation. Condensation is 
thus viewed as an essentially random and unsteady phenomenon as the dissipation experi¬ 
enced by a fluid particle in one blade-row is assumed to be uncorrelated with its previous 
history. On a time-averaged basis, the condensation zone is spread over a much greater 
distance in the flow direction than a simple steady-flow analysis would indicate and may 
encompass several blade-rows depending on the number of stages in the machine. Predicted 
droplet size spectra show broad, highly-skewed distributions with large mean diameters 
and sometimes slight bimodality. These are all characteristics of experimentally measured 
spectra in real turbines. Conventional, steady-flow calculation methods, which predict a 
fixed Wilson point in a specific blade-row and a near-monodispersed droplet population, 
cannot reproduce any of these characteristics. 

As shown in figure 17, a “loss profile” is constructed to represent the pitchwise distribu¬ 
tion (from the suction to the pressure surface) of the loss in the blade-row. The pitchwise 
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loss profiles represent the time-averaged entropy increase along particular pathlines but 
individual fluid particles associated with the passage of wakes may exit from the blade-row 
at different conditions because, on entry, their static temperatures and velocities deviate 
from the mean. 

For the resnlts presented below, a single (circumferentially averaged) pressure-time 
variation based on an axisymmetric streamline-curvature throughflow solution was adopted 
for all fluid particles. Fluid particles are then launched at the turbine inlet (where the 
steam is superheated), all at the same stagnation temperature and pressure. At the en¬ 
try to each blade row a random number is generated that specifies the pathline to be 
followed by the particle. The pathline, in turn, fixes the value of the polytropic effi¬ 
ciency (figure 17). The “black-box” (§2.2) is then applied along the pathline to cal¬ 
culate the suhcooling and the droplet size distribution (if nucleation has taken place) 
at the downstream of the particular blade row. A new random number is then gener¬ 
ated that specifies the pathline in the next blade row and the procedure is repeated. 
The “black-bojc” can deal with successive nucleations after the primaiy as a matter of 
course should the expansion be sufficiently rapid to generate the high levels of subcooling 
required. 

For each fluid paiticle, the subcooling and droplet size distribution at all points of 
interest are recorded. It is then a straightforward matter to compute the time-mean wetness 
fraction and other statistical properties in order to obtain a quantitative picture of the 
process of phase-change and liquid growth throughout the machine. In a six-stage low- 
pressure turbine, some 10"^ fluid particle calculations are undertaken on each streamsurface 
to obtain converged statistical properties. 

It has been assumed that the classical theories of homogeneous nucleation and droplet 
growth realistically describe the process of phase-change for individual fluid particles. In 
other words, if the pressure-time and temperature-time variations of a fluid particle during 
its passage through the turbine can be accurately specified, then it is assumed that the 
theories of nucleation and droplet growth correctly describe the rate of formation of the 
liquid-phase within that particle. For different fluid particles, phase-change will therefore 
be initiated at different locations in the machine, depending on the dissipation suffered by 
the particular particle. This is in contrast to the usual model of nucleation in turbines which 
assumes phase-change to be governed by the time-averaged fluid properties and therefore 
to occur at a single, well-defined location. 

As an example calculation, the flow through the low pressure stages of a 320 MW turbine 
was analysed- The turbine was manufactured by the Italian company “Ansaldo” and the 
complete geonmetry is available in the literature (Guha & Young 1994; Guha 1995). The 
LP section has six-stages. Each stage consists of a stator followed by a rotor and hence 
there are twelve blade-rows altogether in the turbine. 

The physical characteristics are best explained by adopting a Lagrangian viewpoint of a 
fluid particle as it passes through the turbine. As described in the previous sections, different 
fluid particles experience different amounts of dissipation and heat transfer, depending on 
the particular pathline followed. Two limiting cases can be identified. At one extreme 
are the fluid particles which always follow the mid-pitch pathline in each blade-row and 
consequently suffer no dissipation. They pursue an isentropic path to the Wilson point. At 
the other extreme are those particles which negotiate the regions of maximum loss in each 
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Figure 18. Axial variation in the Ansaldo turbine of vapour subcooling for two fluid 
'particles representing the extreme cases of zero dissipation (- - -) and maximum 
dissipation (—). 


blade-row. Other fliuid particles experience levels of loss intermediate between these two 
extremes. 

Figure 18 shows the calculated variation of the vapour subcooling A T associated with the 
two extreme cases of zero and maximum dissipation. The subcooling of the fluid particles 
change as a result of the competing effects of the three physical processes: expansion, 
condensation and dissipation. Consider first the case of the fluid particle suffering no 
dissipation. Here, one of the mechanisms for altering the subcooling, i.e., dissipation, is 
absent. The fluid particle is superheated (negative AT) at the turbine inlet. Its subcooling 
increases in each blade-row due to expansion but remains almost constant between the 
rows. It attains the Wilson point in the stator of the fifth stage and subsequently experiences 
an exponential decrease in AT due to the extremely rapid liberation of latent heat. AT 
increases significantly again in the last rotor where the expansion rate is too high to be 
offset by the counteracting effect of condensation. Much the same history is repeated 
for the fluid particle experiencing the maximum dissipation. Here, however, dissipation 
opposes the increase of the subcooling throughout the flow field. Consequently, the Wilson 
point occurs much further downstream (in the rotor of the last stage). Other fluid particles, 
experiencing intermediate amounts of dissipation, attain their Wilson points at intermedi ate 
locations between the two extremes. The region of nucleation thus covers (in a randomly 
unsteady maimer) almost two complete turbine stages as opposed to being restricted to a 
very narrow zone in a specific blade-row. 

A reliable “rule of thumb” is that Wilson points occurring at locations of higher expan¬ 
sion rate result in smaller droplets. The variable location of the Wilson point therefore 
results in large variations in mean droplet diameter. It should be understood, however, 
that the mean droplet diameters in the two limiting cases do not necessarily represent the 
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Figure 19. Computed Sauter mean droplet diameter at Ansaldo turbine outlet as a function 
of the axial location of the Wilson point for all fluid particles. 


extreme limits of droplet size produced in the machine. The droplet size is dependent on 
the local expansion rate which does not vary monotonically with distance between the 
extreme locations of the Wilson points. 

Figure 19 shows the location of the Wilson points for the totality of particles considered 
(10“^). Each point on this diagram corresponds to an individual fluid particle. The abscissa 
denotes the axial location of the Wilson point and the ordinate denotes the mean diameter of 
droplets within the fluid particle on its arrival at the turbine outlet. (It should be appreciated 
that each fluid particle is actually composed of a polydispersed distribution of droplet sizes, 
details of which, although retained by the computational procedure, cannot be included in a 
single diagram such as figure 19.) It can be seen that the majority of fluid particles nucleate 
either in the fifth-stage rotor or in the sixth-stage stator. (The absence of Wilson points in 
the first part of the sixth-stage stator results from the very low expansion rate there, see 
(Guha & Young 1994; Guha 1995).) In each blade-row, the mean diameter of the droplets 
becomes progressively smaller as the Wilson point moves towards the trailing-edge, as 
the rate of expansion tends to increase monotonically to each blade throat. However, 
interesting behaviour results for those fluid particles that reach the trailing-edge plane with 
subcoolings and nucleation rates which, although moderately high, are still insufficient to 
cause complete reversion to equilibrium. For these particles, the trailing-edge marks the 
Wilson point (i.e., the cessation of nucleation) even if comparatively few droplets have 
yet been produced. Reversion to equilibrium then occurs within the following gap by 
condensational growth on existing droplets. Because there is plenty of time available 
and because the droplet number density is low, these droplets may grow to very large 
sizes as shown in figure 19. Such fluid particles are then prime candidates for secondary 
nucleations in succeeding blade-rows because their liquid surface area is insufficient to 
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Figure 20. Computed time-averaged 
droplet size distribution at outlet of the 

Ansaldo turbine.-mass mean;- 

Sauter mean. 

offset (by condensation) the opposing effect of increased subcooling due to the rapid 
expansion. 

An imaginary probe with unlimited resolution in space and time, sited at the turbine - 
outlet, would register the complete droplet size distribution for each fluid particle it en¬ 
counters. Real probes based on the measurement of attenuated or scattered light, however, 
record only sufficient information to deduce, at most, the time-averaged droplet size distri¬ 
bution (and sometimes only the time-averaged Sauter mean diameter). In order to compare 
the theoretical predictions with such measurements, a theoretical time-averaged droplet 
size distiibution may be constructed at any axial location in the turbine if the diameters 
and number density of droplets in all the 10'^ fluid particles considered are recorded by the 
computer for subsequent processing. The calculated time-averaged droplet size distribu¬ 
tion for the Ansaldo turbine at outlet is shown in figure 20. The spectrum is polydispersed 
and highly-skewed (i.e., there is a large difference between the mean and most probable 
diameters) and resembles the shape of similar spectra measured in real turbines (figure 15). 
This is very significant, as no existing steady-flow calculation procedure can predict such 
a high degree of poly dispersion. 

Unfortunately, no measurement of the droplet size distribution is available for the 
Ansaldo turbine, although the time-averaged Sauter mean diameter of the droplets has 
been measured. The measured Sauter mean diameter at mid-span is about 0.4 p,m, which 
is a little smaller than the calculated value of 0.55 |xm, shown in figure 20. However, 
allowing for the uncertainties and approximations in the calculation scheme, the level of 
agreement is extremely encouraging. Of course, many more experimental comparisons are 
required before it is possible to assert conclusively that the important physical processes 
are being successfully modelled by the theory presented. 

In conclusion to this section, a theory has been developed for predicting the effect of 
temperature fluctuations on the homogeneous nucleation and growth of water droplets in 
multi-stage steam turbines. The fluctuations result from the segmentation of blade-wakes 
by successive blade-rows and the amphtude of the fluctuations increases with the number 
of stages. According to the model, the mechanics of nucleation in multi-stage turbines are 
quite different from the predictions of conventional steady-state theories of phase-change. 
For example, the nucleation zone may encompass (in a randomly unsteady manner) several 
blade-rows (as opposed to being isolated at a particular position in a specific blade-row). 
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The inherent unsteadiness of the process also results in a highly-skewed, polydispersed 
(sometimes bimodal) time-averaged droplet size distribution, having similar characteristics 
to spectra measured in real turbines. The next step would be to include, in the calculation 
scheme, the effects of circumferential variation in pressure within the blade passages. More 
details may be found in Guha & Young (1994) and Guha (1995). 

The well-known books on multi-phase flows (e.g. by N A Fuchs and by G B Wallis) 
describe the long, inexhaustible list of multi-phase phenomena demonstrating their all- 
pervasive occurrence. Here, we have considered only a few important topics, the selection 
inevitably being biased by the author’s own interests. (See Guha 1997b for a new theory 
for particle transport in turbulent flow.) We have used a complementary combination of 
analytical and computational techniques, and differential and integral treatments in order 
to model fundamental processes occurring in two-phase mixtures as well as to explain 
observed phenomena and experimental findings. This is an exciting, rewarding and potent 
field - so many interesting and important things remain to be done! 
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The vortex liquid piston engine and some other vortex 
technologies 
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Abstract. By exploiting three unique characteristics of confined swirling 
incompressible flows - centrifugal acceleration, internal separation or recir¬ 
culation zones near the axis, and bistability (i.e. rarefied and condensed stable 
states) of multi-phase flows - we developed several innovative vortex machines 
which will revolutionize mechanical technologies in a variety of industries. The 
machines utilizing these features include: Vortex Engine, Vortex Thruster, Vor¬ 
tex Suction Device, Vortex Chemical Reactor, Bubbling Centrifuge and Vortex 
Mill As a specific example, we describe here in some detail the development 
of a liquid piston engine, including analysis of its hydrodynamic and thermo¬ 
dynamic features. We have designed a laboratory ‘cold’ model and performed 
detailed experimental, theoretical and numerical analyses to study the role of 
the controlling parameters and are now ready to test a ‘hot’ model. In addition, 
we mention a few other vortex technologies of interest to us. 

Keywords. Multi-phase flows; incompressible flows; vortex machines; vor¬ 
tex liquid piston engine; swirling flows. 


1. Introduction: Rotating single- and multi-phase flows 

While the proposed applications are innovative, rotating flows within vortex chambers have 
been investigated extensively. As shown schematically in figure 1, the tangential entry of 
fluid into a typical cylindrical vortex chamber, via the guiding vanes around the periphery, 
causes swirling motion of the fluid (mixture). In the core region of the flow away from 
the walls, the radial pressure gradient balances the centrifugal force; this is the so-called 
cyclostrophic balance. Near the end walls, the centrifugal acceleration is very small due 
to the boundary layer effect but the radial pressure gradient is essentially the same as in 
the core region. Thus, the lack of cyclostrophic balance causes the pressure gradient to 


A list of symbols used appears at the end of the paper. 



} 



Figure 1. Schematic of various flow re¬ 
gions in an axisymmetric vortex chamber. 


generate a secondary flow towards the axis along the end walls, balanced by the frictional 
force. This leads to an end-wall boundary layer with a substantial inflow radial speed Vf, 
the maximum of (and of axial and azimuthal velocities) occurring inside the boundary 
layer. This inflow of the rotating fluid along the end wall prevents the typically desirable 
prolonged retention of small particles within the chamber, because they are carried away 
by the fluid flow along the axis. However, by appropriately profiling the end walls (we have 
developed an equation for the end wall profile using hydrodynamic analysis and laboratory 
tests), small particles can be retained within the chamber for longer durations. At practical 
operating speeds, the flow in the near-axis region is extremely complex, involving very high 
turbulence and noise-producing axial oscillations. A detailed study of the flow patterns in 



Figure 2. Condensed stable state of a mixture of liquid-solid particle in a vortex chamber. 
Note the dense, homogeneous particle layer near the periphery, forming a rotating axisym¬ 
metric fluidized bed. 
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The 



this and other regions of the vortex chamber for various operating parameters is necessary 
to develop more efficient designs. 

Experimental and theoretical studies (Batchelor 1967; Greenspan 1969; Goldshtik 1981, 
1984; Gupta et cil 1984; Hussain 1986) show that rotating flows possess three fundamental 
features, described below. 


High centrifugal force-. Typical inlet gas velocities 100 m/s) into a chamber of radius 
0.1 m produce a centrifugal acceleration Uc ^ 10^ g. This high centrifugal force is 
central to all applications discussed here, and ensures stability of the central cavity and 
motion of solid, particles/fluid bubbles in circular trajectories inside the cylindrical vortex 
chamber. 


Near-axis flow: Near the axis of rotation, if the speed is high enough, a gaseous cavity is 
formed when a liquid is used as the working fluid, and a recirculation zone (such as vortex 
breakdown bubble or internal separation, i.e. separation away from any wall), forms if the 
fluid is a gas. The complex flow pattern in this region depends on the operating conditions 
(such as Reynolds number and Rossby number) and the chamber end-wall profile, and 
considerably influences the flow in the rest of the chamber. A clear understanding of the 
flow in this region based on rigorous hydrodynamic analysis is essential for optimal design 
of these machines, particularly the vortex engine and vortex thmster. 





326 


M Goldshtik et al 



Figure 4. Rarefied stable state with large 
particles, which move in polygonal paths 
and collide with the outer wall, forming the 
basis for a vortex mill. 


Condensed stable state: At high particle concentrations, the denser medium (solid or 
liquid) forms a tightly packed layer near the periphery of the vortex chamber and provides 
a very large interface area between the two media; figure 2 illustrates a condensed particle 
layer. 


Rarefied stable state: At lower concentrations, the particles are distributed homoge¬ 
neously and move in circular orbits in the lighter fluid medium (figures 3, 4). The radii of 
the orbits depend on the particle size. For solids, larger particles collide with the chamber 
walls and are ground to smaller sizes (figure 4); this leads to the proposed vortex mill 
(see §8). 

In this paper we focus entirely on the hquid piston engine; the other machines will be 
considered very briefly at the end. 


2. Vortex engine 

Conventional piston engines have relatively complicated designs and undergo greater wear 
and tear due to moving mechanical parts. An external combustion, flat-top liquid piston 
Stirling engine (West 1983) is very simple and does not employ complicated mechanical 
parts. The main drawback of such an engine is the instability of the liquid piston’s top 
surface. When the piston is near its top dead centre, its speed becomes zero, but acceleration 

is mavimnm TTnH(=>r tbpsp r‘nnHitinnc a ctuKilitv r'ritprir»n ic that tVi/i r\-f rrvoxrifoti/M-iol 
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Figure 5. Working principle of the liquid piston engine (see Goldshtik 1992); 1,2- oscil¬ 
lating cavities; 3,4- check valves; 5,6- hydromotors. 


acceleration to fluid acceleration must be greater than unity so that no liquid will leave 
the surface of the liquid piston. Therefore, the frequency of the engine must be less than 
one hertz in order to avoid the instability of the flat liquid piston surface. Accordingly, the 
efficiency of this engine is very low, about one percent, and the power of this engine is 
only several watts at best. 

These limitations of the flat top liquid piston engine are overcome by employing a 
rotating liquid piston (Goldshtik 1992), where centrifugal acceleration is used instead 
of gravity. This is an internal combustion engine with two circular cylinders (i.e. vor¬ 
tex chambers) that are partially filled with a fixed volume of liquid and are connected 
tangentially by two channels containing hydromotors (see figure 5). Each of these cir¬ 
cular cylinders has a top and a bottom cover, a system for intake of fuel and air, and 
an associated exhaust system. Each cylinder may have either an electric spark plug or 
may work in diesel mode (via a fuel spray injector). By using tangential entry, the liq¬ 
uid rotates at high speed within the cylinder and creates a vertical cylindrical gas cavity 
around the axis of the rotating liquid. Rotation is used to stabilize the liquid-gas in¬ 
terface of the cylindrical cavity, which functions like the piston top in a conventional 
internal combustion engine compressing the fuel-air mixture. This cavity is the combus¬ 
tion chamber into which the fuel-air mixture is injected. When the mixture is ignited, 
the increased pressure in the cavity forces some of the liquid out through a tangential 
channel, through a hydromotor, and then tangentially into the second cylinder wherein 
it keeps its swirling motion; this sequence is repeated in the second cylinder and the 
liquid then flows back into the first cylinder. In this manner, as a result of the pressure 
from combustion, the liquid is transferred back and forth between the two cylinders 
at a frequency that can be controlled by changing the system parameters. The unidi- 
rectionally rotating hydromotors extract energy for mechanical drive. In summary, this 
device works without any moving mechanical part, with the exception of the hydro¬ 
motors and valves to control flow timing of liquid, fuei/air mixture, and products of 
combustion. 

In this paper, we focus on the detailed theoretical description of the engine, and also 
present some preliminary numerical and experimental results. Results of the engine’s nu¬ 
merical optimization and experimental investigations will be published in the 
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3. Simple theoretical model of vortex liquid piston engine (VLPE) 

3.1 Principal dynamical equation 

The model assumes an unsteady flow of a liquid in two identical interconnected cylinders, 
with near-axis cylindrical cavities filled with an ideal gas and having time-dependent radii 
r\(t) and r 2 (t). The flow between the cylinders generates a self-sustained oscillation due 
to combustion-induced thermal expansion and contraction processes running in opposite 
phases. As a first step, viscosity is neglected; viscous effects are considered later in both 
theory and numerical simulations. 

For axisymmetric flow, we have the following relationships for radial and tangential 
velocities: 

Vr = Qtr and = F/r, (1) 

where Q = Q(t) and F = const, are related to the dimensional physical volume flow rate 
Qv and circulation F/? by 

Qy=27uhQ and F/?=27rF, (2) 

where h is the chamber height. Conservation of the fluid volume gives the relation 

rf-l-r| = 2o-/?^ (3) 

where R is the cylinder radius and a is the total gas/cyUnder volume ratio (1 - cr is the 
total volume fraction of liquid). The radial velocity of the liquid-gas surface is Vr = dr/dt. 
From (1), we have 


Q = r(dr/dt). 

(4) 

With the help of (3) and (4) we can obtain 


Qi = -Q2 = Q. 

(5) 

For an incompressible, inviscid liquid in unsteady, axisymmetric flow, we can use Euler’s 
equation in cyUndrical coordinates; 

dVr dVr 1 dp 

dt ^ ^ dr r p dr' 

dVcp dV(p VrVcp ^ 

^ + , =0. 

1 d{rvr) _ ^ 
r dr 

(6) 

The last two equations (6) are satisfied by (1) if F is constant. Substituting (1) into the 
first equation (6), and integrating from r to R, we obtain 

dQ R e^ + rVl 

dr r- 2 W B?) p ' 

(7) 
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where pr is the gas pressure in the cavity and pR = p{R) is the cylinder wall 
pressure. 

After writing (7) for both cylinders, subtracting and taking into account (5), we get 


_ 1 _ ^^1 ~ PR2 ^ Pr\ - Pr2 

df rir2 V\ ^2/ P P 


( 8 ) 


The pressures p/?, and pR^ not equal; their difference is the available head for hy¬ 
dromotors. We characterize it approximately by the standard load coefficient which is 
assumed to be a constant, related to the pressure drop Ap across the hydromotor by 

Ap = {^(plR^)Q\Q\. (9) 


Now we choose the chamber radius i? as a length scale, the tangential velocity at the cham¬ 
ber entrance V — V(p{R) (so that F = V) as the velocity scale, and nondimensionalize 
the variables. The main nondimensional variable y{t) is introduced by 

rl = R^(c + y), ri = R\a~y), - ( 10 ) 


where y can be considered as a volumetric deviation from the equilibrium state (y = 0). 
Note that, because of (10), (3) is satisfied identically. The nondimensional time r is 


r = 




^dr' 


( 11 ) 


From (4) and (11) 

Q = VRy, and dQ/dt = 2V^y, 


( 12 ) 


where the dot means differentiation with respect to r. Substituting (9)-(12) into (8), we 
obtain 

yin 2 + f |y|y = n(/i -/ 2 ), (13) 

orz _ yz (j^ - y^ 2 

where f\ - fi = (Pri — Pr 2 )IPt)^ n = PqIp^^, and po is the pressure scale, e.g. 
minimum pressure in the thermodynamical cycle used. Equation (13) is the principal 
dynamical equation for VLPE. It is similar to Rayleigh’s well-known equation for bubble 
dynamics, but in contrast, (13) has no solutions for unbounded fluid; it does only in bounded 
cylindrical domain. Equation (13) is no't a closed system because its right hand side is still 
unknown. Thermodynamic analysis is necessary to determine it. 


3.2 Thermodynamics 


From the first law of thermodynamics for ideal gases, 

8q = CydT + pdV, (14) 

where <5^ is the heat supply, Cy is the specific heat at a constant volume, and T and V are 
temperature and volume respectively. Also for an ideal gas. 


pV = RT, 


(15) 





a) 


b) 

Figure 6. Otto cycle and time-process. 


where R is the universal gas constant, i.e. R = Cp — Cy = Cv(k — 1) and k = Cp/cy is the 
adiabatic exponent. 

Equation (13) has relative volume y and pressure / as dependent variables. Therefore, it 
is convenient to choose gas volume V = nr\}i = 7r/?^/t((T+y) as the basic thermodynamic 
variable. Removing T from (14) by using (15) yields 

hq = (Vdp 4- kpd.V)l{k - 1). (16) 


Assuming that the amount of heat hq is given, we can write a closed system equation from 
(14)-(16), in terms of entropy 5 (using = Td5 for a reversible process), 


Po 



(17) 


where Vq is the volume at the pressure p = pQ and Sq is taken to be zero for 
convenience. 

For a cyclic working engine, the function S{t) must be a periodic function of time. 
The specific form of S{t) depends on the way heat is supplied, i.e. on the thermody¬ 
namic cycle. Most conventional engines operate on the Otto cycle, where heat addition 
begins during the phase when the piston is near its top dead centre (TDC). At this mo¬ 
ment, the volume is near the minimum and combustion occurs very rapidly. In our vortex 
engine, this phase corresponds to the minimum cavity radius when the radial velocity 
is zero. 

Let us consider, for example, an ideal Otto cycle, consisting of two adiabatic and two 
isochoric processes (figure 6a). Let this process begin from position a with the maximum 
volume, when y = yp- The adiabatic compression takes place from a to b with 5 = 0. 
Then at the minimum volume (position b), the system suddenly absorbs heat and goes to 
position c. The entropy increases by a jump, after which there is an adiabatic expansion 
from c to d and then exhaust from d to a. The corresponding time-process is shown in 
figure 6b (the curve denotes the periodic condition (21), and the other cylinder is 90° 
out-of-phase with this one). 

For the Otto cycle the value /i — /2 from (13) can be found as 


( ^ + yo 

Vcr + y/ \cr ~y 



/l - /2 = 


(18) 
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where S = const, during the adiabatic half-period. Note that a and vo determine the 
compression ratio e 

^ = {a + yQ)/{a ~ yo). (19) 

The Otto cycle is chosen as an example only; real cycles are smoother. Equation (13), which 
governs the dynamics of our vortex liquid piston engine, has three parameters: cr (the total 
gas/cylinder volume ratio), ^ (the load coefficient), and fl (the pressure parameter). Two 
additional parameters, S (the heat parameter) and yo (the magnitude of oscillations), are 
contained in (18). Thus the system is characterized by five independent parameters, which 
ai'e large for a complete parametric analysis. Some physical conditions will be used to 
reduce the number of independent parameters. 

3.3 Physical conditions 

Periodicity condition: Equations (13) and (18) form a closed autonomous system with 
the following initial conditions 

}'(0) = 3^0 and y(0) = 0, (20) 

where the initial time r = 0 is chosen at maximum yii) = yo- The value yo in (18) is 
not arbitrary but has to be determined from the periodic condition. We assume that in both 
cylinders, the processes are identical but only shifted by half a period ^tc, i.e., 


^2(^ + - r\{r). 


Taking into account (10), we obtain 


y(r + jTc) = -y(r). 

(21) 

Then from (21), we get 


y( 2 '^c) = -yo and yijXc) = 0. 

(22) 


According to (21), (13) should be solved only in the interval 0 < r < jTc but the value 
yo must satisfy (22); this can be achieved by an iterative procedure. It is the principal 
difference in comparison with the conventional internal combustion engine, where the 
piston amplitude and compression ratio are fixed. 

Combustion condition: Let us consider an Otto cycle consisting of the two adiabatic 
curves (ab and cd) and two isochores (be and da) shown in figure 6a. Heat is received along 
be and is rejected along da. Along the process be, 8q = CydT and q = Cy(Tc — Th); thus 

q =qf/(l-\- noa), (23) 

where is enthalpy of reaction, a is the relative air/fuel ratio, and no is the stoichiometric 
(or chemically correct or theoretical) air/fuel ratio. Thus noot gives the actual air/fuel ratio. 
Since q = Cy{Tc — Tt) we have 

Tc~Tb = qfl[cy{l -f noa)] 


(24) 
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On the other hand, for an isochoric process 


Sc - Sjj = Cy In (Ta/Ti,) ~ Cv In E, 


( 25 ) 


where 


£ = 1 + 


Cyinoa + 1)Tq€^-^ 



(26) 


For a typical hydrocarbon fuel, q/ = 20,000 Btu/lb = 46,500 kJ/kg. For typical cases 
using hydrocarbons, no is about 15 kg air/kg fuel. To = 300 K, and a = 1.25. So £0 = 11, 
and for an Otto cycle £ = 1 + 1 If We can rewrite (18) in terms of £; 


/i- 



(27) 


where E is given by (26), and (13) can be solved accordingly. 


Kinematic condition: The flow in a vortex chamber is created by a special device, a 
swirler, which in principle can move or be at rest. In the case of a porous rotating cylinder, 
the flow in the vortex chamber is characterized by two independent functions Q (t) and F (r). 
Of course, in case of a fixed swirler, Q and F are not independent. This can be illustrated 
by a steady flow in a vortex chamber. Let the swirler contain a number of narrow tangential 
slits. The flow rate into the chamber can be approximated as Qy — nbhV, and according 
to (2) we have 

Q = (nbl2Tc)V, (28) 

where n is the number of slits, each of width b (figure 7). 

On the other hand, according to (1), 

= (29) 

where Vr (R) is the azimuthally averaged radial velocity. From (1) it follows that the design 
parameter. 


Q _ _ nb 

F V(p InR' 


(30) 



Figure 7. Tangential slit. 
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V Figure 8. Semi-Otto cycle. 


is a purely geometric characteristic of the vortex chamber which relates Q to T. 
Equation (30) shows that, once a design is chosen, 6 should be a constant. For unsteady 
flow, (28) is no longer valid, but (29) is. For example, at the instant when Q = 0, Ur is also 
zero, but F is obviously not because of vortex inertia (i.e. even without a flow rate, the 
liquid continues to rotate). For an inviscid liquid, such a flow will continue forever with 
maximum F. For real flow, if the viscous time (~ R^/v) is much larger than the period of 
oscillation, we can assume that (30) is correct for the maximum Q, and thus F = Qmay.10. 

Using (30) and (12) we find the important relationship between system dynamics and 
design parameter 6 

I jimax = (nb/lnR) = 6. (31) 

This equation is the condition for the swirler to be at rest. The derivation of (31) is 
speculative and needs to be verified by experiment with unsteady separation process. 
Equation (29) assumes an axisymmetric configuration of the entrances. So the number of 
entrances (n) should be relatively large. 


Semi-Otto cycle: Experimental realization of the Otto cycle is difficult because we need 
a very high maximum pressure in order to use a realistic compression ratio; our goal is only 
to prove the concept, i.e. to establish the possibility of oscillating stable cavity generation. 
In order to model the cavity with comparable liquid velocities, we are going to use a 
semi-Otto cycle (as shown in figure 8) in our subscale “cold” experimental model (using 
compressed air supply) because it operates at lower pressures than the Otto cycle at the 
same compression ratio. 

In our experiments the parameter E is not meaningful, therefore we introduce a pressure 
ratio parameter x which allows us to consider both cycles: 

X — Pmax/Pmn — Pc! Pa- ( 32 ) 

The values E and x related. For the Otto cycle, 

X = (33) 


Introducing x into (27), we obtain 


0 ^ + yo 
O' + y 


- X 


yo 


CT + y 


/i - /2 = 


k 


(34) 



For the semi-Otto cycle we have 

'g 


/l - /2 = 1 


yo 


G 


( 35 ) 


3.4 Energy equation and operating characteristics 


We found that (13) needs to be solved for half a period 0 < r < rc/2 only, where y(T) < 0 
and (^/2)|yly = —{^/2)y^. Then it can be shown that (13) can be written in the form 


iA 

2 dy 


(y^ + l)ln 


1 


G^ - y2j 


= - mfi - A). 


(36) 


Integrating (36) for the half-period of oscillation between y = — yo and y = yo, we obtain 
the energy equation 

t ryo „ ryo 

^ = n / (/2 - /,)dy. (37) 

Z J-yQ J-yg 

Equation (37) states that inertial features do not participate directly in the energy balance. 
The right-hand side of (37) is the work (during a thermodynamic cycle) and the left-hand 
side is the energy consumption. Equation (37) helps to deduce the power W of the engine. 
The general expression for W is 


W = InR^pV^fn r (/, - /2)dy, 

J-yo 

where / is the frequency, 

/ = 2V/Rtc, 

For an Otto cycle, a simple calculation gives 

0 ^ + }’o 


ryo 

/ ' 


(/i - / 2 )dy 


k-1 


■rjEo 


(38) 


(39) 


(40) 


where ry = 1 — e ^ is the well-known efficiency for Otto cycle (Faires & Virgil 1970), 6 
and Eq are given by (19) and (26). For the semi-Otto cycle, using (27), we have 


r if I - / 2 )dy = X^~il - €'-') - 2yo. 

J ~yo K — 1 

It can be shown that the efficiency for this semi-Otto cycle is 
~l) + - € 


1 - 


x-1 


(41) 


(42) 


The efficiency q of the Otto cycle depends on the compression ratio € alone while that of 
the semi-Otto cycle depends on both e and x- Note that at a fixed x, the efficiency (42) 
has a maximum at 6 = where dq/de = 0. Then 


^max — A :- or T^max = 1 


(43) 


X - 1 ' . - 1 

Figure 9 shows a comparison of this efficiency with that of the Otto cycle with k= 1.4. 

Note that for any given compression ratio e, ryotto > ^max- However, for a fixed pressure 
ratio X in the working range, Ty^ax > Votto- 
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3.5 Free oscillations 


Unlike the conventional solid piston engines where the piston displacements are fixed, 
the VLPE has a variable displacement which depends on system parameters. In order to 
achieve the maximum oscillating amplitude (which gives maximum power and efficiency), 
it is very important to know the eigenfrequency of the VLPE. This may also benefit the 
creation of a “shrinking engine” (Popular mechanics 1995) whose cylinder size can be 
changed corresponding to load to reduce energy losses. 


Small oscillations'. The eigenfrequency for small free oscillations can be obtained from 
(13) by assuming that both y and y are small; tliat is 

y In "4 4- \y = n(/i - / 2 ). (44) 

cr^ 

For an adiabatic process, by letting 5 = 0 in (18), we have for the right-hand side of 
(44), 


n(/i -fi) = n 


When y I and yo 1 > 


Vi 


Vl) 


n 


cr 4- yo 

cr + y 


tr + yo 
or - y 


O' + yo 
or + y 


= 1 


-3^ + 


and 


Fyo 

or -y 


1 H—y + 

O' 


(45) 


(46) 


Using these expansions in (44) and (45), the equation for small free oscillations becomes 


yln-4 -f 


1 2 kn 

_l- 


or 


y = 0, 


and the eigenfrequency is 

1 + IkTla \ 


0)0 


cr^ in (1/or^) 


(47) 


(48) 
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COo 



Figure 10. coq as a function of a with 
fixed n. 


(48) shows that ojq has a minimum value with respect to a . This value is achieved when <j 
satisfies the following equation 

(l+lnor)(l+ytna) = i. (49) 

Figure 10 depicts coq as a function of a with ^=1.4 (for air) and nondimensional pressure 
n =4.0 (for po = 10^ Pa and y = 5m/s, O = From this figure, we can see that 

0)0 has a minimum at cr = ormin- This unique feature will be explained later together with 
figure 14 below. Figure 11 shows that cTmin has its largest value, (Jmin = (1 /^) ^ 0.6065, 

at n = 0 and decreases with increasing TI. Figure 12 shows how the minimum frequency 
changes with n. 


Nonlinear free oscillations'.^ Let us consider (36) with ^ = FI = 0. In this case, (36) can 

be integrated to give 

.2 1 
(y + 1) In y = const, 

( 7 "^ — 

The constant can be found from the condition y(yo) = Oj srrd solving for y gives 


T = 


ln(or2 - yg) 
ln(cr^ — y^) 


(50) 
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Integrating (50) for half-period xd'l we have 


L 


^0 ( - yp) 
ln(cr2 — y'^) 

Defining the eigenfrequency fc as 

fc = l/tc, 

from (39) we have 


- 1/2 


- 1 


dy. 


(51) 


(52) 


/ = (2V/R)fc. ( 53 ) 

Figure 13 shows fc as a function of yo for the cases a = 0.4, 0 . 5 , and 0.6. We see that for 
the typical a range, eigenfrequency increases with increasing amount of fluid in the system 
as well as with oscillating amplitude yo. Both these features are the opposite of those of 
an ordinary oscillating system like a U-tube where increasing amount of liquid decreases 
the eigenfrequency and oscillating amplitude decreases with increase in eigenfrequency. 
These features are useful for a real engine, because with the increase of engine frequency 
and power, the oscillating amplitude increases and so does the efficiency, (19). 

Figure 14 shows fc as a function of a for the case yo = 0.15. The eigenfrequency 
has a minimum with respect to the gas/liquid ratio a for finite amplitude oscillation. This 
somewhat unusual non-monotonic behaviour of the eigenfrequency is related to the strong 
dependence of centrifugal force pv^/r on r and can be explained by making analogy with 



yo 


0.1 


0.2 


0.3 


0 4 Figure 13. f as a function of >’o with 
fixed a. 
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Figure 14. /<• as a function of a with 

fixed yo- 


a U-tube system. For theU-tube, the eigenfrequency is ct) = where / is the height. 

For VLPE, when cr is small, we have larger amount of liquid such that r is small. Then the 
centrifugal force pv^/r tends to infinity as cr tends to zero. This is equivalent to a larger g 
case for the U-tube. Therefore, the eigenfrequency is very high for small a. On the other 
hand, when cr is large, we have small amount of liquid such that the corresponding I is 
small. Between these two extremes there must be a minimum. 

3.6 Stability criterion 

The stability problem is very complicated in the general case, because the corresponding 
linearized hydrodynamic equation has time-dependent coefficients, and Floquet theory 
must be used. Therefore, we prefer to investigate the general stability problem by experi¬ 
ment. However, a simple criterion St, necessary for stability, can be found from elementary 
considerations; 

St = min(gc/gr) = min 1/[2((T + y)y - y^], (54) 

where gc(= r) is the centrifugal acceleration and gr is the radial acceleration. If the 
maximum is achieved at the moment of maximum compression, (54) is slightly simplified 
(y = 0). Clearly, g^ must be greater than gr or else the cavity will collapse; hence, the 
minimal 5^ = 1. We therefore use 5^ > 1 as the stability criterion. The larger the value 
of St, the more stable the rotating flow. Another related condition is gc ^ g, where g is 
gravity. 

3.7 Variable circulation 

The previous formulation was based on the assumption that F = const. But how do we 
choose this constant? It clearly depends on flow rate Q \ in particular, if Q = 0 then F = 0. 
For a steady flow, F = —0 Q, where 0 = tan a = const is a swirl parameter and 

a is the blade angle of the swirler device. This follows from the Kutta condition (i.e. no 
separation) for inviscid flow past blades. If a process is unsteady, the direct application of 
the Kutta condition should give 


r(r)--0<2(O, Q(t)<0. 


( 55 ) 
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But the validity of the Kutta condition for unsteady flow is doubtful because of possible 
flow separation. Nevertheless, in this section, we consider (55) to be valid. For such a 
situation, F is not a constant and depends not only on t, but also on r. We use an equation 
for r 


ar 

~dt 


^ 

+ — 0 ; 

dr 



(56) 


Although r = const, is a solution of (56), it does not satisfy (55). (56) can be solved 
by the method of characteristics. Let us introduce the variables Zi(r) and s{t), such 
that 


Zi(T) 


rf(r) 


i = h2- 


2Q 

di' = —5-dr. 


Note that if Q = const. — RV,sve, have s = r. 
From (10), 

rf + rj = 2R^cf. 


Using the new variables, (56) can be rewritten as 

9/^2 

The general solution of this equation is 

r(r, r) = F(r^ - R^s), 


(57) 


(58) 


(59) 


where F is an arbitrary function which can be found from the boundary condition at r = R. 
The boundary condition (55) then becomes 

F(R^(\ - s)) = -eQ(r). 

If s is known, we can determine the function F using the inverse function r = t(s). 

To derive the equations of motion, we assume that at the initial time r = 0 both 
cylinders are filled with equal amounts of liquid, rotating with F = Fq = const. Once 
the motion starts, a part of the liquid flows into one cylinder from the other according to 
(55), Under such conditions, liquid in the first cylinder still has circulation Fq. Let bj be 
the radius of the boundary between this liquid and the remainder liquid having another 
circulation. 

Thus the circulation distribution is 

T.. . [Fo, 

F (r, r) = { 

I F(r^ — R^s), bi < r < R. 


Mass balance gives 

- bf) = rf) - n(R^ - 





•craax — iiiaA <.\L j, cuiu 
0<f'<r 


I 


Jn r 
Consider the integral 


R r'2- /I 

^ dr = Itz^c^Yq ( - 


1 


Z 1 + Z — Zmax 


dr+ f 
Jn Jh 


R F^ir'^ - Rh) 


dr. 




(61) 


(62) 


/ri r- Jt>^ 

The first part of (62) is given by (61). To calculate the second part, we introduce variables 
z! =\~ r^/R^-, z* = l- h\/R^\ ^=z^7!. 

Using (59), the second part of (62) becomes 


f 

Jbi 


R F^(r^ - R^z) 


-L 


-I 


dr 

Jo 

Zmax F"^ - 1)) 


z. F2(i?2(i 


zO) 


dz^ 


2 2^2(1 +^—f)2 Jz 

^2max(z) Z^(or) 

z{r) 


2RH\ - z')^ 

fZmax 


i: 


2 R^Jz ( H - z -^)2 
dz'. 




(63) 


8 A(t) [1 + z(r) - z']'^ 

In order to integrate equation (63), it is sufficient to know the dependence of 2(z) for all 
previous time. In the same way, we can obtafti the corresponding relation for the second 
cylinder by integrating (2) from r 2 to R and taking into account the boundary conditions 
determined by the swirler at the moment of inflow. Thus, we obtain 

r 2 (^, t) = F2(R'^{1 - s)) = eQ{r{s)), Q > 0(02 = -0). 

The result is slightly different from (8). 

Finally we obtain the generalization of (13): 


zln 


1 / 

z(2a- -z) \ 4 


1 


1 


02 


1 T Zixiin ~ 
jZmax z^(z') 


L 


2a - 

Zmin (^ Z “b Z )2 


Z 


1) + —-b 

Z / 1 + Z Zmax 


dz' 


(64) 


Ti-— 779 dz^ +ensignz -4o -—^^^—— 

(I-Z + ZO 2 4 ^ z(2c7-z) 


= 0 , 


where Zmax(0 na^o<r'<T zit ),Zmin(0 — z(f0>^dO < z{t) < min(l, 2a). 

The integro-differential equation (64) determines the evolution 5 (r) for a given variation 
of the temperature in the cavities. 

The initial conditions are 

z(0) — ZO, z(0) = 0, Zmax(O) = ZQ, and Zmin(O) = zo- 


(65) 
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Numerical calculations of the system (64)-(65) were performed using the Runge-Kutta 
method with adaptive step control. The integrals in (64) were calculated using the technique 
of nonlocal splines at each step of integration. 

Figure 15 shows the dependence of amplitude A5 = ^max—‘^min for the periodic regime 
versus the forcing frequency co. This shows that there is some resonance frequency of. 
The small bumps correspond to o)* /2 and 'hcf jl. The solution near of has a minimal 
transition time to periodic motion. The phase portrait of the transient process for (X)~ of 
is represented in figure 16. The phase trajectories of the transient processes are shown in 
figure 17 for and in figure 18 for a> < &>*. 

Note that these results suggest that we can expect the minimal perturbations of cavity 
surfaces near resonance. The complex transient process with high-frequency oscillations 
corresponds to a flow with O) of (figures 19 and 20). 

If ct) > (D*, the high-frequency perturbations do not appear but amplitudes z and z grow 
strongly and exceed amplitudes of periodic regime (figures 21 and 22). This analysis shows 
that regime with u; cd* is preferable. 
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Figure 16. Phase portrait of the transient 
process for o) = o). 
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Figure 17. Phase portrait of the tran¬ 
sient process for o) > of. 


The calculations with T = const, show that an important difference exists only for small 
frequencies o) If cd > &)*, the resonance curves practically coincide (the resonance 

curve for F = const, is shown in figure 23 by the dashed curve). 

The general conclusion is: if the frequency is sufficiently high (a) > o)*), the assumption 
of r = const, is reasonable, because, in this case, the transient processes can be considered 
quasi-stationary processes. 


3.8 The role of viscosity 

Let the flow rate Q{t) be given. The equation for circulation F is 

^ Qi^aF _ 3 1 aF 

dt r dr dr r dr ' 


( 66 ) 


1.00 -1 
ds/dt 
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Figure 18. Phase portrait of the tran¬ 
sient process for co < o)*. 
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where v is the kinematic viscosity. Assume r\ = ro(l + a sin cot), and choose R as the 
length scale to nondimensionalize r. Introducing other variables 

vt _ coR^ Q{t) 2 

r = =-; q =-; and qo = Qr^a, 

R^ V V 

we have 


Q = R^rir\ = R^rQaco{l + a sin cos Qt, 


o.ao 

s(t) 


q = ^o(l + a sin cos Qt. 
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Figure 21. Transient process for a> 




Figure 23. Comparison of resonance 
curves. 
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Equation (66) is transformed into 

ar d^r i+<?(T)ar 

- = --, ^ ^ 1 

‘■•4 9t dr^ r dr 

with the boundary conditions: 

ar 2r 

r(r) = l; — = — at r-ri. 
or r 

The last one is the free surface condition. 

Let us consider two cases: <§; 1 and Q ::$> 1. For the first case 

r = + (9/2)/-«+^1/[1 + (q!2)rp\ 

If = 0, r = r^ corresponding to solid body rotation. If ^ > 1, F = almost no 

rotation. If ^ -1, F = 1 - 2/\q\{r/rxY^- and V ^ For » 1, F = 1. These 

results justify formulation of the problem using F = const., which is correct for inviscid 

flow, for sufficiently high frequency, which is the most interesting case for the real LPE. 
.-4 

4. Numerical analysis of VLPE 

We first want to estimate the engine parameters using an Otto-cycle. First, we estimate 
the natural oscillation frequency. Using (48), for typical values of the parameters, viz., 
k = lA, a ■= 0.25, and fl = 4, we have <wo = 6.26 and co — 626rad/s. This value of co 
corresponds to n 100 rev/s 6000 rpm, which is typical for the maximum rpm of an 
internal combustion engine for automobiles. Thus, the vortex engine has good dynamic 
characteristics and can be started quickly. 

The value of ^ is roughly estimated as follows. For the mean values we have W = /\pQv, 
or from (2) and (9) 

W = ^{pQl)/(S7Z^R'^h^). (67) 

Let W = 100 kW (= 134 hp), R = OAm, h = 0.1 m, and p = 1000 kg/m^. To estimate 
the value Qy, we can use the formula Qy = jrr^hn, where r is the mean radius which 
can be found by setting r\ = r^. Using (3) we find r — Ryfa and for a = 0.25, we have 
r = 0.5R = 0.05 m. lfn = lOOrps, Qy = 0.078 m^/s. From (67) then 

^ = {%7t'^R^h^W)/{pQl) = 1664. 

For the given dimensions and flow rate we estimate the fluid velocity V = 5 m/s at 
the entrance of the cylinder where r — R. This is a moderate velocity. 

For numerical solution of (13), we use condition (20) and integrate for half a period until 
y = 0. At this moment, if (22) is satisfied, we have yo and all otlier engine characteristics. 
Otherwise, we use secant or Newton’s method to find a new yo and start again until (22) 
is satisfied. 

For the example considered (i.e. W = lOOkW = 134hp, = h ~ 0.1m, ^ = 

5000, V = 5 m/s, and o — 0.4), we obtained numerically c ^ 10, q = 60%, and a 
large margin for stability, St = 72, corresponding to the acceleration Oc ^ 1300 g at the 
maximum compression point. In order to have an acceptable efficiency value, we assign 




Figure 24. Efficiency for Otto cycle with fixed ^. 


the compression ratio, (19). We want to have a maximum magnitude of oscillation, in 
which case the maximum radius r\ = R. This yields yo = 1 — which together with 
(19) gives (7 — 0.55 and yo = 0-45. 

We chose the following parameters for the experimental device: R = 0.1m, ^ = 
0.019 m, h = 0.3 m and = 5 m/s. From numerical calculations, |y|max = 0-03. The 
corresponding working parameters are: ^ = 4.49, ^ = 20 000, / = 1.05 Hz, ac = 806 g, 
St = 1986, W = 669 kW, Ap = 1.12 atm, Gmax = 0.028 m^/s, Pr\ - PR2 = 2.25 atm, 
PRl = 2.25 atm, and pRi — pri = 1.13 atm. The low frequency here is only for detailed 
investigation of the process and is not typical for the vortex engine. All these parameters are 
used and verified in our experiments. A distinct feature of the vortex liquid piston engine 


Power 

Otto cycle with ^=1000 



Figure 25. Power for Otto cycle with fixed ^. 
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Frequency 

Otto cycle with C= 1000 



Figure 26. Frequency for Otto cycle with fixed ^. 

is that its efficiency increases with increasing power. This can be explained by the increase 
of the oscillation amplitude of the radii of the cavities and therefore of the compression 
ratio 6. The vortex engine differs significantly from the conventional internal combustion 
engine where e is fixed and rj decreases with increasing power. 

To complete the numerical analysis, we calculated the power, efficiency, and frequency 
of this engine for Otto cycle. Figures 24-26 show the results for a constant f = 1000. 
From these figures, we can see for the Otto cycle: 

(1) both engine power and efficiency increase with x » but change little witli cr. 

(2) engine frequency decreases with cr (figure 26). This is a unique feature because it 
shows that the frequency increases with the mass of the oscillating system. 


Power 

Otto cyds wth 0 = 0-5 



Figure 27. Power for Otto cycle with fixed a. 
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X 

Figure 28. Efficiency for Otto cycle with fixed a. 


Figures 27-29 show the results with a constant cr = 0.5. From these figures, we can see 
for the Otto cycle that power, efficiency, and frequency, all decrease with increasing ^. 

Figures 30 and 31 show the case when x is small. From these we can see for the 
semi-Otto cycle: 

(1) engine power increases with increasing a (figure 30b) and x (figure 30a); 

(2) engine power has a maximum with respect to CCCmaxw) (figures 30a and b); with 
increase of cr (less fluid), ^max w decreases (figure 30b); however, if x is large enough, 
Cmax 10 is almost constant with increasing x ^ 


Frequency 

Otto cycle with a = 0.5 



Figure 29. Frequency for Otto cycle with fixed <x. 
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(3) engine efficiency increases with increasing a (figure 31b) and x (figure 31a); 

(4) engine efficiency has a maximum with respect to C(Cmax^) (figure 31a); with increase 
of G (less fluid), increases (figure 31b); however, when i; is large enough, engine 
efficiency will be almost constant (figure 31a). 

Figure 32 shows the similar case for the Otto cycle. 

5. Engine design and experimental analysis 

After finding all the necessary relationships and the optimized design parameters, we built 
a working model using water and air. Figure 33 shows our first design. It consists of two 
cylinders connected by two tangential channels and a check valve, so that water can flow 
in each channel only in one direction. 

We used an external oscillating circuit to control all four solenoid valves (one air inlet 
and one exhaust valve for each of the chambers). The control circuit is shown in figure 34, 
where R2 can be adjusted to achieve different oscillation frequencies. 

After the first prototype was built and tested, we discovered a serious problem with 
this design. We had assumed that high-speed water would enter the chamber and rotate 
inside it to create a stable axisymmetric cavity. However, a very complicated turbulent 
flow without swirl and cavity was produced (figure 35) because of the tangential outlet 
whose lip generates a big disturbance due to flow impingement and thus an obstacle to the 
rotating flow. A swirling wall jet of water hits the outlet conduit and is deflected from this 
obstacle back to the centre of the chamber. 

In order to eliminate this “outlet effect” and provide a smooth surface for the rotating 
flow, we completely blocked the upper channel (figure 36a) and divided the lower one into 
two halves (figure 36b): one half is for incoming flow and the other half for outflow. This 
time swirling flow and a cavity were created. However, on injection of pressurized air into 
the cavity, the cavity immediately collapsed. After careful analysis, we found the main 
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•tl2V 



lo 2i'(l Iniel 

and 151 e«housi Figure 34. Control circuit. 

reason is because the outlet is not axisymmetric. This breaking of the system symmetry 
causes the cavity to be destroyed. 

Since axisymmetry is crucial, it was thus necessary to modify our model. Figure 37 
shows a meridional cut of the actual modification where the block in the middle prevents 
air from exiting with the liquid when this model works horizontally. 

Figure 38 shows a picture of the experimental configuration. As expected, the cavity 
created with this modified model expands smoothly when pressure is applied. Therefore, 
we have achieved a working chamber with an expandable and contractible cavity inside 
it. The cavity created is quite stable as predicted by our theory. All the system param¬ 
eters measured, such as oscillation frequency, flow rate, and power, are comparable to 
our theoretical model and numerical results. Figure 39 shows the minimum cavity in the 
periodic working engine. One can see from these pictures that the cavity, though suc¬ 
cessfully generated, is not quite axisymmetric. The reason is the non-axisymmetry of the 
inlets and outlets of the system and machining roughness. This is why further redesign 
was necessary. 

We realized that the most important design goal is to make the system as axisymmetric 
as possible, because any non-axisymmetry will most likely cause the cavity to be unstable 
or to collapse under sudden application of pressure. In order to have an axisymmetric inlet, 
we decided to use a distributor as shown below (figure 40). This distribution forms evenly 
distributed water jets and helps us create a more stable cavity. 



Flow in 


Figure 35. Cavity did not fonn in the first 
model. 
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Figure 39. Minimum cavity of the peri¬ 
odic working engine. 


For simplicity of pipe connections, we used a horizontal configuration for thq first 
experiment, which caused an unstable cavity and air-water mixing. Therefore, the next 
design used a vertical configuration. 


6. Engine redesign and experimental analysis 

In order to avoid air/water mixing which occurred in the first model, we decided to adopt 
a vertical configuration with swirlers (figure 41). From the previous model, we realized 
that the most important design goal is to make the system as axisymmetric as possible, 
because any non-axisymmetry will most likely cause the cavity to become unstable or to 
collapse on the sudden application of pressure. 

In order to have an axisymmetric inlet, we designed swirlers as shown in figure 42. This 
swirler forms evenly distributed water jets which helps to create a more stable cavity. 

The number of blades on the swirler is not arbitrary and is related to the dynamics of 
the system. Its relationship with hydraulic load ^ and frequency f is shown in figures 43 
and 44 respectively. 

If we want this model to work at a modest frequency, say 5 Hz, just to demonstrate the 
principles, from figure 44 we find the corresponding n is 12. 

The geometry of the swirler is also related to the dynamics of the system and has been 
carefully designed. Figure 45 shows a typical entrance geometry where angle BAD forms a 



Figure 40. Water distributor. 




Figure 41. Vertical configuration. 




Figure 43. Load coefficient ^ as a func¬ 
tion of n. 



Figure 44. Oscillating frequency as a 
function of n. 






The vortex liquid piston engine and some other vortex technologies 


357 


Figure 45. Swirler geometry. 

single entrance. It has been proven that when the two blades intersect at point A, circulation 
is maximum for a given flow rate. As shown before, we introduced a new parameter 
0 = VrlV(p — lyl-From figure 45, it can be found that 0 = cotA = (1—sin^ A)^/^/sinA. 
When —0 (tangential inlet), we have 

^ f■ '1^ . . . “ . “ 2 

0=4 sm —h 1 — 4 sm — sin-cos — 

V 2 2 4 4/ 

where a — iTcjn. 

We also improved the control system. First, we removed the external oscillator and 
used the signals from engine check valves as control parameters. Then we replaced the 
mechanical relay control system with a new microcontroller and solid-state relays. In 
order to achieve the semi-Otto cycle, air supply valves Ai and A 2 (figure 46) must be 
open for only a very short time 7’ (to mimic combustion), and this time should be ad¬ 
justable. Aj, Ej, and 1/ are the air supply, exhaust and inlet check valve for chamber i 
respectively. 

Figure 47 shows the control sequence for the valves having a very short adjustable time 
t for the air inlet valves to simulate combustion. 

We use a Basicon MC-IZ microcontroller to implement this control sequence. It is a 
fully self-contained general purpose programmable controller with CPU, RAM, ROM, 
real-time-clock, I/O, communications circuitry and a resident BASIC language. 

Figure 48 shows the setup of the new model and figure 49 is a close-up to reveal the 
cavity. 





12 


Figure 46. Schematic of the engine 
control. 



12 open 12 close 12 open 12 close 12 open 12 close 

Figure 47. Timing diagram. 


7. Summary 

Here we briefly review the basic principles and potential advantages of the VLPE. The 
two cylinders are interconnected by two tangential, unidirectional flow channels containing 
hydroraotors. Because of the tangential inlets, the liquid is caused to rotate in each cylinder 
and creates a vortical liquid body with a cylindrical cavity in the centre. Each cylinder 
has an inlet valve for the intake of fuel and air, as well as an associated exhaust valve. 
The cylinders can either have an electric spark plug or work in a diesel mode with fuel 
injectors. 

The rotating liquid in the engine simultaneously performs three functions: (a) it creates 
a stable cavity; (b) it stores rotating kinetic energy (a liquid “flywheel”); and (c) it has 
effective elastic properties for damping detonation and permits a higher compression ratio 
than that of the conventional engines. 

Our mathematical model describes the behaviour of this two-cylinder engine. Here we 
present the design and operating parameters of a 100 kW engine. The diameter of each 
cylinder is 0.1 m, the cross-sectional area of the tangential inlets and outlets is 20 cm^ 
and the velocity of the input liquid is 10 m/s. The corresponding pressure drop across a 
hydromotor is 14 atm. The thermal efficiency for the Otto cycle is 0.6 and the compression 
ratio is about 10. The stability criterion for this example is 72, which ensures a large 
margin of stability. The acceleration in the maximum compression stage is very large 
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approximately 1300 g. When running in diesel mode, the power and efficiency of an 
igine of the same size will increase significantly because of higher compression ratio, 
le preliminary experiments have demonstrated the stable cavity in the “cold” model of 
)rtex liquid piston engine. 

The engine employing a rotating liquid as a piston has the following advantages: 

) It is small, lightweight and generates large amounts of power; 

) It is simple to service and repair, having low maintenance; 

) The damping of detonation allows high compression ratios, and thus significantly 
improving thermal efficiency; emission of environmentally damaging pollutants is 
dramatically reduced; 

) If a receiver is available to act as an accumulator, a new type of hydrodynamic braking 
is possible; part of the kinetic energy might be transferred to the receiver by the 
hydromotor acting as a hydropump to provide energy to the receiver. 


Other vortex machines - concepts and applications 

''e are using the features of vortex flows outlined in the introduction to develop several 
)rtex machines, briefly mentioned below. 


jrtex thruster (VT): It is essentially a bladeless helicopter, a real ‘flying saucer’ of 
•actical relevance. The basic working principle of this device is to create a strong swirling 
Dw which produces a low pressure above a surface, thus generating lift or thrust. The 
virl in the incoming flow is achieved via an open vortex chamber, and the resulting low 
:essure rarefaction is intensified by an airfoil-shaped diffuser, which ensures an attached 
Dw without separation. Referring to figure 50, the propulsion device has an open vortex 
lamber (5) which serves to create a strong swirling open fluid jet; (1) is a source of 
:essurized fluid, and (2) is a header for uniform flow distribution. The flow enters the 
virler (3), acquires strong angular momentum and exits over the dilfuser (4), where it 
verses direction (by the Coanda effect) and is ejected into the ambient fluid. Such a 
Dw is like an artificial tornado, creatine a strone rarefaction (low-nressure) zone on the 
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Figure 53. Schematic of VNR. 


scroll. This allows uniform distribution of the fluid along the guide device (2), The fluid 
then enters the vortex chamber with appropriately profiled walls (4) and leaves through 
exits (6). The fluid plays a two-fold role: First, it removes heat as it flows through the 
particle layer; second, rotation of the fluid ensures confinement of the layer. 

The high centrifugal acceleration makes the VCR a more efficient fluidized bed than 
conventional technology which utilizes gravity. VCR is particularly well-suited in chem¬ 
ical reactors using catalysts, and in burning pure, granular aluminum as fuel for military 
and aerospace applications. The swirling flow in the vortex chamber with the exhaust at 
the centre guarantees residence of the particles (catalysts, reacting fuel, etc.) within the 
chamber while they are essentially consumed, thus significantly increasing efficiency and 
reducing wastage. The fluidized layer would also be useful in the agriculture industry for 
efficient surface drying and separation of grain. 

A specific potential application of this technology is a vortex nuclear reactor (VNR), 
which employs a concentrated pseudofluidized layer (CPL). The CPL consists of small 
spherical fuel pellets rotating in a special vortex chamber surrounded by a fluid (liquid or 
gas) acting as a heat transfer medium. To illustrate its basic operation, the active section of 
the VNR for space apphcations is shown schematically in figure 53. It consists of granular 
fissionable material formed into a CPL in a special vortex chamber. 

The VNR consists of several parts: fuel pellet and hydrogen reservoirs, fuel pellet CPL, 
injection guide slots, beryllium moderator, and exhaust nozzle. It has the following possible 
uses: 

(i) nuclear reactor with high specific power output utilizing radically reduced amounts 



(iii) high neutron flux research reactor and breeder. 

The VNR has significant safety advantages over conventional nuclear reactors because 
of minimal amount of fuel pellets in residence in the chamber at any time, very high 
specific power, compact size for ease of maneuverability, and a self-cleaning capability 
for encrusted fuel elements. This latter feature permits use of the VNR with heat transfer 
media such as oil or sea water. 


Bubbling centrifuge (BC): Vortex bubbling devices aie used to enhance heat and mass 
transfer processes between liquids and gases by injecting the gas into the rotating, cen- 
trifugally stabilized liquid. Direct control of the strength of the centrifugal force permits 
the construction of very compact machines with high efficiency compared with conven¬ 
tional devices which rely on gravitational force for their operation. Since this force cannot 
be varied, such an apparatus must frequently be veiy large in order to process the nec¬ 
essary volume of gas. Thus, there are major expenses associated with the construction, 
transportation, and operation of these devices. High maintenance costs are also associated 
with the frequent removal of interior scale and deposits. With its drastically reduced size 
and simplicity of operation, vortex barbotage machinery offers significant reductions in 
costs, at all levels. In the condensed stable state (of the bistable states) of a gas-liquid 
mixture in a vortex chamber (figure 54a), a self-organized, rotating homogeneous layer of 
foam is formed neai' the periphery of the chamber (Goldshtik 1981). In this layer, the high 
centrifugal acceleration suppresses the entrainment of drops and increases the liquid-gas 
interface; this provides a very large reaction surface area per unit volume and enhances 
interfacial transport, further accentuated by turbulent mixing. 

Principle of operation - Gas is injected from the walls into a swirling flow of liquid, 
forming a rotating bubbly or frothy layer. Extensive theoretical and experimental studies 
have determined the precise vortex chamber shape and other design parameters necessary 
to ensure a stable froth layer. 




Figure 54. Bubbling device and layer. 
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Under these conditions, the stable froth layer maintains a liquid-gas force balance at 
the chamber walls as well as in the interior, ensuring uniformity in gas/liquid interaction 
as the gas passes through the liquid. A photograph of such a layer in a recent laboratoiy 
experiment is shown in figure 54b. The tangential injection of gas at the chamber walls 
through small nozzles creates bubble streams which then transfer their angular momentum 
to the liquid. Under normal operation, this generates centrifugal forces in the chamber 
which are 10 to 100 times greater than gravity. Such increased forces permit the use of 
much smaller diameter bubbles, which greatly increases the gas/liquid interfacial con¬ 
tact area. In addition, the very high speeds of the gas bubbles through the liquid and the 
strong secondary cross-flows in the liquid significantly enhance the liquid/gas interfacial 
interaction. Experiments have shown that a homogeneous froth layer can be maintained 
with correct design, without disaiption by bubble coalescence or other nonuniformities. 
Due to these factors, a vortex barbotage apparatus can be built that can process the same 
volume of gas as a conventional device which is ten times its physical size and which 
consumes more energy. Here is a summary of the advantages of the vortex bubbling 
machine. 

(i) Compared to conventional bubbling equipment, it is up to 10 times smaller and weighs 
as little as 1000 times less than comparable gas processing devices. 

(ii) It has very high efficiency due to enhanced thermal and mass transfer processes. 

(iii) Its small size and weight make transportation, and possible installation in existing 
facilities easy. 

It can function with a wide range of gas and liquid flow ratios; thus the device can 
operate under strong inertial loading (e.g. on a moving vehicle), or in a micro-gravity en¬ 
vironment (space apphcations); and works with the froth layer detached from the chamber 
walls; this reduces formation of scale and encrustation on walls, reducing maintenance 
costs. 


Possible applications include: 

(i) both particulate and vapour pollutant emission control at fossil fuel power generating 
plants. The worldwide market for such devices is in billions of dollars; 

(ii) absorption and desorption of gases: e.g. carbon dioxide, nitrous oxide, oxy¬ 
genation and carbonization of solutions, and many other chemical technological 
processes; 

(iii) vaporization and condensation of liquids, such as in the thickening, heating and cool¬ 
ing of solutions, e.g. milk; 

(iv) petroleum refining having a wide variety of applications; 

(v) three-phase processes, in which the solid particle/liquid mixture is kept in a gas 
stream; this has important applications in the mining industry, e.g. flotation or segre¬ 
gation of fine gold from mine waste, and 




(vi) vortex nuclear reactor with a liquid uranium active zone. Because the critical temper¬ 
ature of liquid uranium is ~ 20,000 K, it is possible to heat the hydrogen working 
fluid up to temperatures of 6,000-7,000 K, giving a specific impulse of ~ 1, 500 s. 


Vortex mill (VM): Theory and experiments (Goldshtik 1981, 1985) show that large par¬ 
ticles can be ground efficiently while in the rarefied stable state (of the bistable states) of 
a solid-gas mixture in a rotating flow (figure 4). Adding acoustic resonators to the vor¬ 
tex chamber (figure 55) produces an intense sound field which considerably improves the 
grinding performance. We conjecture that this sound aids in grinding by preventing the 
V healing of collision cracks between successive impacts, thereby facilitating crack propaga¬ 
tion. Operating parameters can be modified appropriately to allow extremely fine grinding, 
and thus VM has a very crucial use, e.g. in obtaining pure (99.9999%) yttrium powder 
needed for making sintered film coatings for cathodes in the,electronics industry. This 
is also an important ingredient in the manufacture of high-temperature superconductors. 
VM can also be used in cement production, and in the mixing of cement and fly ash in 
the manufacture of concrete slabs. In power plants, VM can permit on-site grinding of 
coal immediately prior to combustion, increasing combustion efficiency and eliminating 
olf-site pulverizing to reduce transportation costs; in fact, it can be an integrated part of 
the combustor. VM can grind plastic as well as brittle material and thus can help grind 
used tyres for recycling, and polymers for use as additives for drag reduction in pipelines, 
submarines and boats etc. It can also be used as a very efficient liquid atomizer. 


The vortex liquid piston engine project is sponsored by Advanced Research Project Agency 
(ARPA) under grant No. MDA972-93-1-0020. We are grateful to Dr Ed Carapezza for 
numerous technical discussions and Dr George Broze for reviewing the manuscript. 
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9. List of symbols 


Cp, Cy 


HQ 
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V 
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e 
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X 

CO 


specific heats; 

temperature ratio of combustion; 

nondimensional pressure; frequency; 

chamber height; 

adiabatic exponent, Cp/Cy\ 

number of slits on a swirler; 

stoichiometric air/fuel ratio; 

pressure; po - lowest pressure in a cycle; 

enthalpy of reaction; 

area flow rate; Qy- volume flow rate; 

oscillating cavity radius; 

universal gas constant; cylinder radius; 

entropy; 

stability coefficient; 
temperature and torque; 

volume and inlet velocity; Vq ~ volume at pressure po\ 
radial and tangential velocities of interface; 
engine power; 

nondimensional volume; yo - initial value of y; 

swirler distribution angle, relative air/fuel ratio; 

circulation; 

compression ratio; 

thermal efficiency; 

load coefficient; 

kinematic viscosity; 

pressure parameter; 

velocity ratio; 

density; 

ratio of the total gas volume to total cylinder volume; 
nondimensional time scale; 
period of oscillation; 

ratio of maximum to minimum pressures during one cycle; 
eigenfrequency; coq - eigenfrequency for small oscillation. 
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Jet flames from nondrcular burners 
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Abstract. Gas jets from noncircular exits entrain more air from surroundings 
than jets from circular exits of equivalent area. Because the mixing rate of fuel 
and air governs the combustion and pollutant emission of diffusion flames and 
partially premixed flames, noncircular geometries offer a passive control of the 
flame characteristics. In this paper, the literature on nonbuming, noncircular 
jets is reviewed and recent studies on noncircular jet flames are discussed with 
focus on the work conducted in the author’s laboratory. 

Keywords. Gas jets; noncircular burners; jet flames; flame characteristics. 


1. Introduction 

Combustion and pollutant characteristics of fuel gas jets are controlled by the mixing rates 
of fuel and ambient air. Hence, manipulation of fluid mechanics in the vicinity of the burner 
mouth by varying the exit geometry appears to be an attractive method of controlling the 
thermochemical processes to increase combustion efficiency and reduce pollutant emission 
from diffusion flames. This paper presents the research on tliis topic with focus on the work 
at the University of Oklahoma. 


2. Background 

Noncircular exits are often considered to be passive and inexpensive methods of control¬ 
ling the characteristics of isothermal jets. The geometries usually considered are elliptic, 
triangular, and rectangular with a small aspect ratio of 2 to 4. The nonaxisymmetric flow 
field tends to remove its instability by gradually becoming symmetric downstream. In the 
process, the smaller dimension tends to become larger and the larger dimension tends to 
become smaller, leading to a phenomenon called axis-switching. Before achieving sym¬ 
metry, a few switchings usually occur. This characteristic of noncircular jets has been well 
documented (Sforza et al 1966; Schadow et al 1984; Gutmark et al 1985; Ho & Gutmark 
1987; Hussain & Hussain 1989; Quinn 1989; Gollahalli et al 1992). The locations and 
the number of switch-overs are strongly dependent on the initial conditions, the aspect 
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ratio of the nozzle exit, and when excited on the flow Strouhal number. The dominance of 
coherent structures and azimuthal instabilities are stated as responsible for axis-switching. 
Koshigoe et al (1988) have theoretically analysed the condition for deformation of noncir¬ 
cular jets. The analysis of large eccentricity elliptic jets performed by Crighton (1973) has 
shown them to be spatially as well as temporally stable. Ho (1986) states that entrainment 
in two-dimensional shear layers and axisymmetric jets occurs during the vortex-merging 
event. In nonaxisymmetric jets, however, vortex-merging and azimuthal deformation occur 
simultaneously. 

In comparison to the number of studies on nonreacting jets, work on the combustion of 
noncircular jets is severely limited. Schadow et al (1984) have found in a large-scale study 
that the combustion efficiency in elliptic jets is about 10% higher than in circular jets. 
Better mixing is shown to be the cause of this improvement. Gollahalli and his associates 
(Schadow et al 1984; Gollahalli & Prabhu 1990; Prabhu & Gollahalli 1990; Kamal & 
Gollahalli 1993; Subba & Gollahalli 1995) have investigated the structure and pollutant 
formation in flames of noncircular jets at various conditions. Cannon & Queiroz (1994), 
have studied the time-resolved temperature measurements in an elliptic turbulent diffusion 
flame. Recently, Papanikolau & Wierzba (1995, 1996) have investigated the lift-off and 
blow-out characteristics of diffusion flames. These studies are discussed in the following. 


3. Turbulent diffusion flames 

The results in this section are extracted primarily from the works of Prabhu (1989), Khanna 
(1990), and Kamal (1995). The experiments by Prabhu and Khanna were carried out 
on the same burner and experimental apparatus. Prabhu investigated the thermochemical 
characteristics and Khanna studied fluid mechanics of the turbulent gas diffusion flames 
from elliptic nozzles. Kamal recently focused his work on the coupling of burner size and 
exit Reynolds number on the noncircular burner effects. 

3.1 Experimental apparatus and instrumentation 

The experiments in these studies were performed in a vertical flow combustion apparatus. 
This facility had a combustion chamber of square cross-section (76 cm x 76 cm) and 117 cm 
height. The chamber had air-cooled Pyrex glass windows (20 cm x 20 cm x 92 cm) on 
aU the side walls. While drawing gas samples and probing the temperature field, one 
of the glass windows was replaced with a slotted metal sheet through which the probes 
were introduced. The glass windows extended down to the floor level of the chamber 
to permit optical probing of the flames in the vicinity of the burner mouth. The burner 
assembly mounted to the floor of the chamber consisted of a contoured nozzle section 
located concentrically inside a cylindrical chamber. The fuel nozzle had four parts: a 
divergent inlet section, a cylindrical settling section, a convergent contoured section, and 
interchangeable nozzle tips. Nozzle tips of different configuration could be attached to the 
contraction section. Except in Kamal’s work on burner size effects, a circular nozzle tip of 
exit diameter 9.5 mm i.d. was used. The elliptic nozzle tips (major diameter = 16.4 mm 
and minor diameter =5.5 mm) had the same exit area as that of the circular nozzle. The 
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Figure 1. Details of the elliptic nozzle 
tips for diffusion flame studies. 


ratio of the length of the elliptic cylindrical portion to the major diameter of these nozzle 
tips was at least 7, ensuring the ellipticity of the initial geometry of the fuel jet. Figure 1 
shows the details of the elliptic nozzle tips. A mixture of propane and nitrogen was used as 
the jet fluid for the reason discussed later in relation to flame stability results. Combustion 
air was supplied from an oil-less rotary-vane compressor to the annulus surrounding the 
nozzle. An assembly of a honeycomb section and two sets of fine-mesh screens ensured 
uniform and low-turbulence air flow concentric with the fuel jet. In all experiments reported 
in this study the velocity of co-flow air was maintained at 0.66 m/s. The fuel jets were 
ignited with a pilot Bunsen flame which was withdrawn during tests. 

Direct colour photography with an exposure time of 1 second was used to determine the 
shape and dimensions of the visible flame. A two-mirror (200 mm diameter) Z-aiTangement 
schlieren system with a xenon stroboscopic light source of flash duration 1.5 p.s was used 
to visualize the flow structure in the near-nozzle region of the flames (KJianna 1990). The 
temperature field was probed with a thermocouple. The composition field was determined 
by gas sampling and analytical instruments consisting of nondispersive infrared analysers, 
a polarographic analyser, a chemiluminescent analyser, and a gas chromatograph. Soot 
concentration was determined by filtering known volumes of diluted gas samples. Radiation 
emitted from flames was measured with a wide view angle radiometer. The flow field in the 
vicinity of the flame base was probed with a laser Doppler velocimeter. Both nozzle and 
air-flow streams were seeded with magnesium oxide particles. A microcomputer-based 
data acquisition system was used to acquire and process the output data of the LDV. The 
details of instrumentation and error analysis are given by Prabhu (1989), Khanna (1990), 
and Kamal (1995). Table 1 shows the nominal experimental conditions. 

3.2 Flame stability 


Most of the studies on flame life-off have focused on the burner exit velocity as the 
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Table 1. Nominal experimental conditions. 


Type of flame 

Turbulent diffusion flame 

Partially-premixed flame 

Fuel 

Propane and nitrogen mixture 

Natural gas 

HHV (MJ/kg) 

11-18 

37.3 

Energy input rate (kW) 

750-1232 

5.86 

Fuel jet velocity (m/s) 

— 

49.1 

Fuel jet Rey 

— 

6500 

Burner exit velocity (m/s) 

5.60 

1.275 

Burner port Rep 

7810 

3640 

Ambient temperature (K) 

297 

295 

Ambient pressure (mm Hg) 

745 

749 

Ambient relative humidity 

65% 

65(%) 


delineated. Prasad et al (1991) employed a method in which the lift-off phenomenon can 
be studied in terms of the reactant concentration in the jet fluid while keeping the burner 
exit velocity constant. This method is useful to isolate the effects of parameters like nozzle 
shape by decoupling their influences from the accompanying changes through the jet exit 
velocity. Hence, a mixture of N 2 and CsHg was used as the jet fluid and the concentration 
of C 3 H 8 at transitions was used as an indicator of the flame stability. 

Figure 2 compares the transition characteristics of flames from circular and elliptic 
(aspect ratio 3:1) nozzles when the mole fraction of propane, Xp, in the jet stream is 
decreased while keeping the exit velocity approximately constant at 5.65 ± 0.30 m/s. The 
nominal exit velocity corresponds to a Reynolds number of4740 in the case of the circular 
nozzle, and 7810 and 2620, based on the major and minor diameters in the case of the 
elliptic nozzle. Here L is the flame standoff distance and Deq is the diameter of the circular 



Figure 2. Lift-off and reattachment char¬ 
acteristics of turbulent diffusion flames 
from circular and elliptic nozzles. 
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Dzzie of equal exit area. When Xp in the jet stream of the circular nozzle was above 
.34 the flame was attached to the burner. At Xp equal to 0.32, the flame base lifted off the 
amer and stabilized about 3 nozzle diameters above. When Xp was between 0.32 and 0.34 
le flame base position was not well-defined and the flame was seen to be detached from 
nly a part of the nozzle rim. The flame edges appear smooth and the flow near the flame 
ase appears laminar. This transition is denoted as “Laminai' Liftoff” and is shown as LL 
[1 the figure. When propane mole fraction is further decreased to 0.28, the liftoff height 
iddenly increases to about 10 diameters while the flame base widens to about 4 nozzle 
iameters. Flow near the flame base is turbulent in this configuration. This transition is 
enoted as “Turbulent Liftoff” and is indicated as TL on the figures. When Xp is decreased 
irther L increases slowly before flame blows out at Xp = 0.24. With the elliptic nozzle, 
le first transition LL occurs at Xp = 0.38, the second transition TL occurs at = 0.33, 
nd flame blows out at Xp = 0.24. This shows that elliptic nozzle flames require more 
^active component in the jet fluid to avoid transitions of burner-attached flame to liftoff 
nd blowout conditions than circular nozzle flames. Also, the flame-standoff distance does 
ot remain constant between the first and second transitions in the elliptic nozzle flames, 
1 contrast to the circular nozzle flames. 

When propane mole fraction in the jet fluid is increased while the flame from the 
ircular nozzle is in the lifted condition, the L/Deq versus Xp curve does not retrace its 
ath and exhibits hysteresis similar to the hysteresis noticed in earlier studies (Scholfield 
: Garside 1949; Gollahalli et al 1987). In those studies, the differences in the turbulence 
haracteri sties of the flame-standoff region upstream of the flame base between the attached 
nd lifted flames were thought to be responsible for the hysteresis. The hysteresis behaviour, 
owever, is not significant in the flame from the elliptic nozzle. The earlier initiation of 
istabilities and consequently higher turbulence level in the elliptic nozzle jets seem to be 
jsponsible for the negligible hysteresis in the liftoff-reattachment characteristics of the 
lliptic nozzle flame. 

.3 Fuel concentration profiles 

'he axial concentration profiles of propane in the attached flames from circular and elliptic 
ozzles are shown in figure 3. We notice that fuel concentration decreases more slowly 
long the axis and the fuel persists longer in the circular nozzle flame than in the elliptic 
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Figure 3. Effect of nozzle shape on the 
axial concentration profiles of fuel in tur¬ 
bulent diffusion flames from circular and 
elliptic nozzles. 



nozzle flame. For instance, the value of Xp decreases to 0.005 at x/D = 48 in the circular 
nozzle flame, whereas the same value is reached at x/Z)eq = 34 in the elliptic nozzle 
flame. The rapid depletion of propane is caused by the greater dilution due to the higher 
entrainment in the elliptic nozzle flame. 

3.4 Temperature profiles 

Figure 4 shows the radial temperature profiles at three axial levels in the circular and 
elliptical nozzle flames attached to the burner. These locations are in the near-nozzle, 
midflame, and far-nozzle regions of the flames. In both flames the near-nozzle profile has 
a sharp off-axis peak. A comparison of these profiles reveals that the circular nozzle flame 
has a slightly higher peak temperature compared to the elliptic nozzle flame in the near¬ 
nozzle region. However, in the midflame and far-nozzle regions, the elliptic nozzle flame 
has higher peak values. As homogeneous gas phase reactions are dominant in the near- 
nozzle region, the more rapid development of a shear layer and the higher degree of mixing 
with air in the elliptic nozzle flame could lead to a lower peak temperature. On the other 
hand, in the midflame and far-nozzle regions, the kinetics-controlled heterogeneous soot 
oxidation reactions are the primary mechanisms of heat release (Gollahalli 1977). These 
kinetics-controlled processes are enhanced in the elliptic nozzle flame by the larger oxygen 
availability, thereby leading to higher rates of burning of soot. The higher heat release 
rates and the lower heat loss rates due to lower concentrations of soot result in higher 
temperatures in the midflame and far-nozzle regions of the elliptic nozzle flame, compared 
to the temperatures in the corresponding regions of the circular nozzle flame. It appears 
that the dilution effect of increased entrainment of air into the flame is overshadowed by 
the enhancement of soot combustion which results in higher temperature in the midflame 
and far-nozzle regions of the elliptical nozzle flames. The differences in the jet growth rate 
determined from velocity measurements in circular and elliptic nozzle flames presented 
later are in conformity with the temperature profiles. 

3.5 Nitric oxide and particulate concentration profiles 

Figure 5 shows the radial concentration profiles of nitric oxide at three axial levels in the 
attached flames from the circular and elliptic nozzles at the same conditions at which the 
temperature field was probed. Except in the far-nozzle regions, the differences in the peak 
concentration of nitric oxide in the circular and elliptic nozzle flames follow the trends 
of temperature profiles discussed above, as expected of thermal nitric oxide produced 
through Zeldovich reactions. In the near-nozzle region, the peak concentration of nitric 
oxide is higher in the circular nozzle flame, because of the thinner interfacial reaction 
zone that results in a higher peak temperature than in the near-nozzle region of the elliptic 
nozzle flame. In the midflarae region, the elliptic nozzle flame has a higher nitric oxide 
concentration than the circular nozzle flame, which also follows the trend of temperature. 
In the far-nozzle region, the average of the NO concentrations on the major and minor axes 
of the elliptic nozzle is slightly less than the NO concentration level in the circular nozzle 
flame. 
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Figure 6. Effects of nozzle shape on soot 
concentration and opacity in turbulent dif¬ 
fusion flames from circular and elliptic 
nozzles. 


The radial variations of particulate concentration and opacity of the combustion product 
samples drawn from the far-nozzle regions of the circular and elliptic nozzle flames are 
shown in figure 6. Clearly, soot concentration in the elliptic flame is smaller than in the 
circular flame. The difference in the opactities of samples from the two flames is not as 
large as the difference in the mass concentration of soot, probably because of the variance 
in the size distribution of the particulates. The lower soot concentrations in the elliptic 
nozzle flame are in conformity with the effects expected from the higher entrainment of 
air. The higher entrainment of air (Gutmark et al 1985) promotes soot oxidation rates in the 
far-nozzle region. Consequently, the particulate concentration in the elliptic nozzle flame 
is lower. 


3.6 Nitrogen oxides emission index 

Figure 7 shows the comparison of the emission index (El) of NO^ of the circular and elliptic 
nozzle flames. The values of emission index (ng/J) of elliptic nozzles are normalized with 
the emission index of circular nozzle. The letter E corresponds to the elliptic nozzle and 
the number following the letter E refers to its aspect ratio. It is clear from the figure that in 
turbulent diffusion flames the EINO.v is not significantly different for elliptic and circular 
nozzles. This can be traced to the opposing variations of the peak NOx concentrations 
in different regions of the flame. The smaller value of EINO^ reflects the trends of the 
NO concentrations in the far-nozzle region. The degree of reduction, however, varies 
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Figure 7. Comparison of nitric ox¬ 
ide emission index in turbulent diffusion 
flames from circular and elliptic nozzles. 


. Laminar partially premixed flames 

'he studies presented here on this topic are taken from the works of Kolluri (1992), Rao 
1994), and Subba (1995). The natural gas furnaces of residential and commercial heating 
ystems employ partially premixed laminar flame burners. These burners are operated 
/ith low supply pressures, typically of the order of a few centimeters of water column, 
nd combustion occurs generally at atmospheric pressure. Hence, the designer’s choice of 
arying the operating parameters to control fuel-air mixing rates and pollutant emission 
haracteristics of these burners is severely limited. Motivated by the argument that air 
ntrainment into isothermal gas jets can be significantly altered by the passive approach 
f using the noncircular geometries for the nozzle exit, a research program in the author’s 
iboratory has been focused on changing the shapes of vaiious crucial components of 
n inshot gas burner of a residential natural gas furnace to noncircular geometry. The 
xhaust emission indices of NO, NO^, and CO, in-flame profiles of temperature and species 
oncentrations, air entrainment rate, and flow patterns have been studied for noncircular 
eometries (elliptic, triangular, and rectangular) of fuel nozzles, venturis, and burner exit 
orts. The discussion here is limited to only elliptic geometries of the burner exit port. 

.1 Burner apparatus 

'igure 8 shows the sketch of an inshot burner rated at 5.46 kW used in a natural gas- 
ired residential furnace. Experiments were conducted in the combustion chamber and 
ae instrumentation facilities described in § 3.1. The natural gas was supplied from the 
ab supply line and heating values were monitored on a daily basis. The fuel flow rate 
3 the burner was adjusted in order to keep the energy input rate to the burner constant, 
"he flow velocity and the area of the elliptic burners were maintained the same as those 
if the circular burner, and hence the observed changes in flame characteristics could be 
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Figure 8. Sketch of the inshot burner with standard and modified tips. 


attributed to only the shape of the burner exit port. Table 1 also contains the experimental 
conditions in this series of studies. 

4.2 Results 

Figure 9 shows the effects of elliptic geometry of the burner tips on the ambient air- 
entrainment into the nonbuming jets. Entrainment rate was detennined by measuring the 
concentration of oxygen at 15 Deq corresponding to the visible flame length. It is clear that 
the elliptic burner tips increase air-entrainment by as much as 30%. More interestingly, 
the entrainment change is not monototic with the aspect ratio of the burner tip. The shad¬ 
owgraph pictures of the flames (Subba 1995) with the standard circular and noncircular 
burner tips show that the flow emerging from the burner port has azimuthal instabilities 
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Figure 9. Air-entrainment into the par¬ 
tially premixed flames of circular and el¬ 
liptic tip burners. 
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Figure 10. Comparison of temperature 
profiles in the partially premixed flames of 
circular and elliptic tip burners. 


indicated by the axial streaks in both flames. However, the elliptic burner flame shows 
more numerous streaks confirming that the azimuthal instabilities are enhanced by the 
elliptic geometry of the burner. 

Figure 10 shows the temperature profiles in the near- and far-nozzle regions of the 
flames of the circular and the elliptic burners. It is clear that the trends of these profiles 
are strikingly similar indicating that there is no dramatic shift in the dominant physico¬ 
chemical processes due to the shape of the burner tips. The peak values of far-nozzle 
temperature in the elliptic burner flames, although not conspicuous in the figure because 
of the compressed scale, are lower than in the circular burner flames. This decrease can 
be attributed to the higher air-entrainment and lower soot production in the elliptic burner 
flames. In the near-nozzle region, the temperatures are not significantly different in the 
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Figure 11. Comparison of nitrogen ox¬ 
ide concentration profiles in the partially 
premixed flames of circular and elliptic tip 
burners. 
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Figure 12. Nitrogen oxides emission in¬ 
dex in the partially premixed flames of cir¬ 
cular and elliptic tip burners. 


two flames, presumably due to the fact that the increase in the oxidation rates of the fuel 
pyrolysis fragment species compensates for the increased dilution caused by the higher 
air-entrainment in that region. 

Figure 1 i shows the NO.t concentration profiles in the circular and the elliptic nozzle 
(aspect ratio 4) flames. Similar to the temperature profiles, the NOx concentration profiles 
also show double hump structure in the near-nozzle and midflame regions and either flat 
or single axial peak structure in the far-nozzle region. It is clear that interfacial combustion 
at the flame edges continues to be dominant until the midflame region. In conformity with 
the temperature profiles and entrainment results, the values of the peak NO^ concentration 
in the far-nozzle region are generally lower in the elliptic burner flame. 

Figure 12 shows the effect of burner shape on emission indices of NOx- It is clear that 
the elliptic shape E2 has a favorable effect on the pollutant emission indices of the burner, 
although the effect is less than about 15%. 


5. Concluding remarks 


Flow visualization studies of the stmcture of gas jets and flames have shown that the large 
scale coherent structures initiated by Kelvin-Helmholtz instabilities are responsible for 
the entrainment of surrounding fluid (Ho & Gutmark 1987). Also, it has been shown that 
noncircular geometries of the nozzles introduce many azimuthal instabilities which inter¬ 
act with these organized stmctures, and consequently, produce differential radial growth 
rates of jet and the consequent axis switching. Thus, air-entrainment rate and combustion 
c aracteristics of flames are affected by the noncircular shape of the burner, which has been 
demonstrated m the case of turbulent propane diffusion flames. Although the partially pre- 
ames were laminar, the results show that somewhat similar features exist. Since 
the burner exit conditions were such that the flames were not fully turbulent, the effects 
on en^inment and temperature field in this case seem to be smaller than in the turbulent 
ase. e ig er air entrainment into the near-nozzle region increases the oxidation of 

e pyro ysis s|^cies of the fuel and reduces soot formation rate. Further, the additional 
dilution also reduces peak temneratures and roncpniioni-U/ ___i*_ 
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Parallel power paths and compactness of gear transmissions 
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Abstract. Developments in mechanical power transmission have hinged 
around better materials and processes, new design ideas and effective new 
application of enduring scientific design principles and ideas. The current pre¬ 
sentation aims at showing how deliberate measures to optimize load distribution 
have increased the compactness and capabilities of present day gear transmis¬ 
sions. Exploitation of parallel power paths, packing of gears in annular spaces 
and the effective use of epicyclic action by reducing the number of gear meshes 
and substituting rolling friction are explained. 

Keywords. Mechanical power transmission; load distribution; parallel power 
paths; compact gear transmission. 


1. Introduction 

Transmissions serve primarily to convert and transmit motion and torque from a power 
source in a suitable manner for application. The source almost invariably provides uniform 
rotation. When it is converted into a non-uniform rotation, the device used is a non- 
uniform motion mechanism or a non-uniform transmission. Otherwise we have a uniform 
transmission or simply a transmission. 

While there is a great variety of basically different transmission systems including hy¬ 
drostatic transmissions and electric drives, we confine ourselves to purely mechanical 
systems here. In endeavouring to show how certain basic design principles play an im¬ 
portant role in the modem development of compact transmissions, we confine ourselves 
mostly to gearing and indeed to only parallel-axis arrangements. 


2. Basic design objectives 

In transmitting power through a fairly high reduction, compactness is a basic requirement 
of steadily increasing importance. High efficiency then becomes essential even for small 
power units as the heat dissipation problem becomes more acute. 
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There are many other objectives such as reliability, cost effectiveness, ease of mainte¬ 
nance, low noise levels, and ability to withstand critical environments. Keeping in view 
the basic aim of this presentation as stated in the introduetion, we confine ourselves to the 
question of compactness and problems connected with the same. 

3. Load, stress and strength 

Out of this triplet of basic factors forming the foundation for compaet designs, stress 
is decided mainly by form and size, while strength represents the role of material and 
treatments. Great care is taken to minimize the stress and maximize the strength. A very 
significant aspect of the load factor in realising the modern compact tranmissions is that of 
load distribution/equalization. This is however often lost sight of. In aehieving a satisfac¬ 
tory level of load distribution, both a carefully developed application of design principles 
and the availability of sophisticated manufacturing facilities have played a pivotal role. 

Design for effective load distribution thus becomes the main thrust of the cun'ent pre¬ 
sentation. We limit ourselves to spur and helieal gearing. 

4. Transverse distribution of load 

This refers to the distribution of tooth load between two or more pairs of gear teeth 
simultaneously in contact. Manufacturing accuracy (particularly in the form of low base 
pitch error) is naturally the first requirement for a good distribution of load. A high contact 
ratio (total contact ratio for helical gears) represents the theoretical occurrence of several 
simultaneous tooth-pair contacts. A large number of teeth, a large tooth height and a low 
pressure angle contribute to a high contact ratio. A large tooth height contributes to tip load 
leverage (for tooth root stress) but the tip load itself is reduced due to the high contact ratio. 
A more flexible tooth further helps in load distribution. A low pressure angle contributes to 
the basic all-load pitch-point contact stress but the pitch-point load itself may be reduced 
by the high contact ratio. The use of a large number of teeth on a given pitch cylinder 
is mainly limited by the bending strength since the tooth size is reduced. The tooth foiin 
however improves against the bending stress. Finally a large contact ratio generally reduces 
the noise level significantly. 

Optimization of the number of teeth, pressure angle, tooth height and tooth profile 
modifieations represents one way of improving the compactness of the basic spur gear 
pair. This qualitative discussion is to highlight the load distribution aspect. 

5. Longitudinal distribution of load 

This refers to the distribution of load along the length of the gear tooth. Manufacturing 
accuracy (particularly in the form of low tooth helix error) is naturally the first requirement 
for a good distribution of load. 

In terms of practical requirements, compactness consists not only of low volume of space 
occupied but also involves achieving a low radial dimension, i.e. a low centre distance. This 
involves the permissibility of a relatively large face width. A larger face width however 
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Figure 1. Straddle-mounted pinion situ¬ 
ated near the bearing at the non-torque end 
of the shaft. 


involves not only a possibly larger inaccuracy but also a larger bending and torsion of 
the pinion. Overhanging pinion is therefore avoided as far as possible and the pinion is 
straddle-mounted in bearings that are as close to the pinion as possible. The torsion effect 
on the pinion can be neutralised by the bending effect to some extent if the pinion is situated 
nearer the bearing at the non-torque end of the shaft. This is indicated in figure 1, 

In the case of multistage gearing there is the possibility of a pinion on an intermediate 
shaft receiving power from both the ends. This is illustrated in figure 2. Maldistribution 
due to the torsion effect can be minimized in this fashion. 

To improve the load distribution further, tooth helix modification has to be resorted to. 
This can be most advantageously used when the direction of major torque application is 
fixed and the magnitude of working load is fairly certain. The tooth helix correction may 
be made based on the most predominant load level or the average load. The correction for a 
symmetrically mounted pinion with one-sided torque application is illustrated in figure 3. 
In practice, an approximation by a helix angle correction or a single-sided crowning will 
often suffice. When both faces of a tooth are being coiTected, the middle one-third of the 
tooth length may be left uncorrected (i.e. straight for spur gears) or linearly corrected. This 
is to help tooth helix inspection, as shown in figure 4a. When only one side is corrected 
(e.g., lifting equipment), circular arc form crowning can be ground on that side as shown in 
figure 4b. The other side is left uncorrected for tooth helix inspection. The unsymmetrical 



Figure 2. Pinion on an intermediate shaft 
receiving power from both the ends in mul¬ 
tistage gearing. 






Figure 3. Illustration of correction for a symmetrically mounted pinion with one-sided 
torque application. 


crowning in figure 4b is the result of the approximate linear correction already referred 
to earlier and symmetrical crowning is introduced, to counter manufacturing inaccuracy. 
In case of doubt it is better to go in for simple tooth-end relief since over-correction can 
prove to be detrimental. 


6. Load distribution between different gear pairs: Parallel power paths 

Figure 5 shows dual trandem gearing, also called the locked train. There are now two 
parallel power paths as shown in figure 6. Maldistribution of load between the two power 
paths can result from tooth errors (especially pitch errors), errors in the relative angular 
position of gears (3' w.r.t. 2' as against 3 w.r.t. 2 in figure 5), errors in the circumferential 
location of the intermediate gears as planets around the sun gears (1 and 4 in figure 5) or 
errors in 5i and 54 in figure 5. 

Figure 7 shows a simple planetary gear drive with three simple planets. The three parallel 
power paths are also shown in the same figure. Compactness is achieved by packing the 
annular space between the sun gear 1 and internal gear 3 with several planets. Up to 6 planets 
are used, obtainable speed ratio being reduced as the number of planets is increased. The 


(a) 
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Figure 4. (a) Middle one-third of tooth 
length is left uncorrected for tooth helix in¬ 
spection when both tooth faces are being 
corrected, (b) Only one side is corrected 
and circular arc form crowning is ground 
on that side. 
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Figure 5. Dual tandem gearing, also 
called locked train. 


ompactness is however conditional on the achievement of an equitable distribution of 
Dad between the parallel power paths. 

There are basically three different ways of proceeding towards load equalisation amongst 
arallel power paths: 

(i) manufacturing accuracy, 
li) flexibility, and 

iii) kinematically correct design (statical determinacy). 

Some flexibility is always present, due to the elasticity of the machine elements. If there 
/ere no flexibility, only one power path would carry the entire load at any particular mo- 
lent, depending on the relative magnitudes of the tooth spacing errors involved. Flexibility 
nables the gaps in the other power paths to close when the total load reaches a certain 
ivel. The additional load is carried equally by all the power paths. 

Thus, with the same flexibility and ultimate load intensity, the lower the spacing errors, 
le more equitable is the distribution of load. Given the same spacing errors and the same 
Itimate load intensity, the higher the flexibility the more equitable is the load distribution 
mongst the parallel power paths. High flexibility, however, can cause vibration problems, 
eyond a point. It is therefore essential that a high degree of manufacturing accuracy be 
laintained, consistent with cost effectiveness. It may then be found, especially for small 
nits, that the flexibility already present in the system is enough. 

With a kinematically correct design, there is statical determinacy in the system. That 
5, the loads in the individual power paths are known (independent of the spacing errors 
/hich vary through a cycle and from unit to unit). The loads may not always be exactly 
qualized (this is further influenced by friction which is somewhat indeterminate). 
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Figure 7. Schematic of a simple planetary gear drive with three planets. The three parallel 
power paths are also shown. 


The standard kinematic solution to the load distribution problem for the three-planet 
drive of figure 7 is to ‘float’ the sun gear 1. This is normally achieved by a gear coupling 
between the motor and the sun gear as shown symbolically in figure 8. The coupling trans¬ 
mits a pure torque without imposing a lateral force on the sun gear 1. Ignoring friction, the 
three total tooth normal forces between the sun gear and the three planets are equalized 
when the planets are spaced at 120° on the carrier. This load equalization is greatly depen¬ 
dent on effective lubrication of the gear coupling. This lubrication can easily fail since the 
internal movements in the coupling engendered by the varying tooth spacing errors in the 
gearing proper are rather small. 

The sun gear centre moves in small loops, occasioned by the spacing errors. These move¬ 
ments can generate very significant inertia forces at high speeds with their dynamic effects 
on the meshes with the three planets. It is thus seen that a high degree of manufacturing 
accuracy is still essential. 

Instead of the sun gear, the carrier may be ‘floated’ by providing a gear coupling at the 
output. The torque at the carrier being higher, the gear coupling will be larger. 

Similarly, the internal gear may be floated. Coupling lubrication is tlien more involved 
since there is no rotation to help. Some proven high-power high-speed drives float both 
the sun and the internal gear. The inertia effects mentioned earlier are thus mitigated. 

Kinematically correct designs for more than three planets have also been developed. 
Figure 9 shows a design for a 5-planet drive schematically. Here the carrier C is floated by 
means of a gear coupling at the output (coupling not shown). The sun gear is not floated. 




M otor 
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Gear Coupling 


Figure 8. Symbolic representation of 
gear coupling between the motor and the 
sun gear. 
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Figure 9. Schematic design for a 5-planet 
drive. 



Auxiliary carriers C' and C" carry two planets each. The dimensions of the carriers are 
adjusted to provide as equal a distribution of load as possible amongst the planets and 
ensure static balance. The planets are equally spaced. 

The above and similar developments are based on the planar model. While theoretical 
studies have been made with the three-dimensional model, a practical approach to the third 
dimension is to use crowning on some of the gears. 

Kinematically correct designs for the dual tandem gearing of figure 5 are now considered. 
Out of a large number of design possibilities, some are indicated below. 

(i) Float the driving pinion (gear 1) by gear coupling. 

(ii) Mount the driving pinion (gear 1) on a long lever. 

(iii) Guide the bearing housing, in which the intermediate shafts are mounted linearly, 
perpendicular to centre distance. 

(iv) Use helical teeth for gears 1, 2, 2'. Make the two intermediate shafts 2-3 and 2'-3' 
axially free. Balance the axial forces on these two shafts by means of springs or by 
means of the fluid connection as shown in figure 10. 

(v) Introduce additional pinion V with opposite helix (vis-a-vis 1) to mesh with 2' and 
make the combined driving pinion 1-1' axially free. 

One of the guiding principles is to float only members with low inertia. 

Some of the flexibility solutions to the load distribution problem are now indicated. 
In the dual tandem gearing of figure 5, torsionally flexible couplings may be interposed 
between gears 2 and 3 and between gears 2' and 3'. Alternatively, the torsional flexibihty 
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Figure 11. Increase of torsional flexibi¬ 
lity of two intermediate shafts by means of 
the quill shaft design. 


of the two intermediate shafts may be increased by means of the quill shaft design of 
figure 11. In the 3-planet drive of figure 7, the internal gear may be mounted on flexible 
material (or equivalent designs in metal). 

7. Assembly conditions for parallel power paths 

Closely linked with the statical indeterminancy of the parallel power path arrangements is 
the problem of assembly of gears of the additional power paths beyond the first. In figure 5, 
gear meshes 1-2, 3^ and 1-2' can first be assembled without any problem. Gear 3' can 
then be assembled with gear 4 only at particular angular positions of 3' with respect to 2'. 
If one of these relative orientations is used, then the gears can be assembled. It is normally 
desired however to make units 2-3 and 2'-3' of identical specifications. The angles 5i and 
^4 may then be suitably chosen or the number of teeth adjusted. In the planetary gearing 
of figure 7, the sum of the number of teeth on the sun gear and on the internal gear should 
be divisible by the number of planets, if the planets are to be equally spaced. 

8. Compactness through epicyclic action 

When a fairly high ratio of speed reduction is required, use of single-stage gearing will 
make the wheel very large. Two, three or more stages are thus introduced. The space 
requirement however can still be too large. Two or three simple planetary stages may be 
used and the coaxial input and output exploited. The compactness is otherwise due to the 
multiple planets in the individual stages. 

In a simple epicyclic gear train consisting of two coaxial gear shafts a,b and a carrier 
shaft c, we can choose the tooth numbers such that the carrier-reference speed ratio fab is 
very near to -1-1, b being fixed. We can then obtain an epicyclic speed reduction ratio of 
fca = 1/(1 — /ab)- If hh — 1-02, the Speed reduction ratio from carrier c to co-axial gear a 
will be = 1/(1 — 1.02) = —50. Since the ratio iab is very nearly 1, the tooth numbers are 
small and the drive is very compact for the large ratio demanded. The efficiency however 
goes down severely. With a tooth power loss factor of0.005, the efficiency will be only 66% 
and becomes just 50% if the tooth power loss factor is 0.01. With a low efficiency, large 
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Figure 12. The ‘open’ train - reduction 
in the number of gear meshings from 2 to 
1 in a power path. 


leat is generated and the particularly compact unit is not equipped to dissipate the same. 
Po improve the efficiency, one step is to reduce the number of gear meshes in a power path 
fom 2 to 1. The result is the ‘open’ train shown in figure 12. Here the absolute rotation of 
he planet is the output motion. This is ‘brought down’ to the axis of the co-axial unit by 
neans of a constant velocity coupling using rollers. To obtain a large reduction, the number 
)f teeth on the planet and internal gear differ by a small number, creating the problem of 
ip-to-tip interference. Special involute tooth proportions are needed. Using rollers in place 
)f involute teeth on gear b and a tooth difference of 1, the tip-to-tip interference problem is 
nrcumvented by the rollers never leaving the mating planet. The demand for a low tooth 
)Ower loss factor is simultaneously satisfied. (Due to rolling friction as well as low roller 
/elocities; the latter are due to the internal-external mesh of almost -1-1 ratio.) This is the 
:yclo-drive system. Instead of the rollers being rotated about axes fixed on the ring gear 
mtemal gear), they may be housed in a separate cage and interposed between concave 
Dockets on the ring and the planet profile. This is the quadrant drive system. 
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Abstract. Vibration is an effective tool in detecting and diagnosing some 
of the incipient failures of machines and equipment. The present paper deals 
with the basic principles, which may help in identifying its diagnostic ability, 
the scope of its diagnostic capabilities, the instrumentation in vogue for its 
monitoring and the state-of-the-art of the monitoring techniques and programs. 
A few case studies are also given to illustrate how machine troubles/failures 
are diagnosed with the help of vibration signatures. 

Keywords. Vibration analysis; vibration signatures; machine diagnostics; 
condition monitoring. 


. Introduction 

?he subject of vibration generally deals with methods to determine the vibration character- 
stics of a system, its vibratory response to a given excitation and the means to reduce the 
ibration. Reference is however hardly made to its diagnostic ability. A vibration signature 
neasured on the external surface of a machine or a structure contains a good amount of 
nformation, which, if properly interpreted, can reveal the running condition of the ma- 
hine. It may be regarded as one of the languages through which a machine tells us of 
ts ailments. This information is useful in making suitable decisions regarding machine 
»r plant maintenance. Attempts have been made for the last two decades to exploit this 
eature of vibration signature for the benefit of machine condition monitoring, and the 
esult is a well proven technology now in vogue. This success has been possible due to the 
remendous developments made by specialists in the areas of instrumentation, electronics 
ind computers. Books by Collacott (1973, 1977) are often referred to, whenever a topic 
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out of maintenance, only when the condition of the equipment requires it as a safeguard 
from incipient failure. This strategy helps in reducing maintenance costs and chances of 
unscheduled breakdown or shutdown of the equipment. An improvement in productivity 
is inevitable. However, this requires sophisticated electronic instruments to monitor the 
condition of the equipment. Hence a knowledge of such instruments is essential. 

Before a brief description of such instruments is given, it is thought appropriate to list the 
diagnostic capabilities of a vibration signature vis-a-vis machine ailments and to enunciate 
the basic principles involved, which may help in exploring further such capabilities. The 
present paper provides a glimpse of the state-of-the-art of this diagnostic technique and 
the possible future ti'ends in its application to plant and machine maintenance. Some case 
studies are also included to illustrate how a fault could be diagnosed through vibration 
signatures. 


2. Basic principles involved 

The technique of condition monitoring through vibration is built up on the following 
principles 

(1) Any malfunctioning or deterioration in the operation of a machine component gives 
rise to increase in vibration level. 

(2) Vibration emanating from a component consists of certain frequencies depending upon 
its nature of operation. This frequency information does not get changed or lost during 
transmission of vibration, however, their vibration level may be attenuated. 

(3) Mixing of different vibrations does not cause any loss of the individual’s frequency 
information. 

(4) Every individual component or system has its own frequency, called its natural fre¬ 
quency, which changes only when the system parameters get affected. 

Figure 1 illustrates nicely some of the principles enunciated above. The unbalance 
in the coupling generates vibrations with frequency equal to the rotational speed, the 
bearing generates vibration of frequency depending on the number of balls and the gear 




Figure 1. Vibration generation and transmission. 
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lerates vibrations of tooth-meshing frequency. These vibrations mix with each other 
ing transmission, and the mixed signal is picked up by the transducer. The mixed 
nature has all the frequencies of the individual vibrations, i.e. no information with 
:ard to frequency is lost in mixing or transmission. 


Diagnostic capabilities of a vibration signature 

/ibration signature taken from an appropriate location in a machine can reveal the pres- 
;e of the following machine defects: Imbalance, misalignment, imperfect foundation, 
ohanical looseness, rubs, antifriction bearing defects, faults in belt drive, faults in gears, 
eve bearing looseness, oil whirl, blades/vanes defects, local resonances, etc. 

[n general, any machine defect, which alters the dynamic behaviour of the machine can 
detected through a vibration signature. 


Instrumentation and diagnostic parameters 

r carrying out vibration monitoring, suitable instrumentation is required and, depending 
the nature of the diagnosis, appropriate processing parameters, also called diagnos- 
parameters, should be identified. This section deals with these aspects of vibration 
mitoring. 

Selection of proper instrumentation for vibration monitoring is an important feature, 
vibration measuring system consists of a sensor or transducer, a signal processing unit 
i a display unit or recorder. The transducer is used to convert mechanical vibration 
o an electrical signal. The signal processing unit conditions, amplifies, integrates or 
ferentiates and analyses the signal for its frequency spectrum. Finally, the display or 
:order unit presents the vibration characteristics in an interpretable form. 

The three vibration transducers generally used have the following practical measurement 
iplitude ranges: 

• the proximity displacement transducer 100:1 

• the velocity pickup 1000; 1 

• the piezoelectric accelerometer 30 x 10^: 1 

us, from this angle, the piezoelectric accelerometer should be the best choice. Further, 
is lighter and has better frequency range for application. These days accelerometers 
^ used as sensors in most vibration monitoring systems. Some instrument manufacturing 
ns, particularly Briiel & Kjaer of Denmark, have developed a well-perfected technology 

• vibration monitoring with these sensors. 

Proximity displacement transducers are generally used in case of rotating machines hav- 
I their rotors mounted on oil bearings. Displacement probes, generally two in number at 
:h bearing support, monitor the orbital motion of the rotor. It is the shape and nature of 
jse orbital paths which provide the clues for impending defects in the rotor. Defects like 
balance, misalignment, oil/shaft whirl and rotor cracks are easily detected. M/s Bently, 
wada, USA are the pioneers in developing a technology with eddy current probes as trans- 
cers. The turbo-generator sets of many powerplants are equipped with these instruments. 


Table 1. Time-based vibration analysis (Tranter 1989). 


Overall level (r.m.s.) measurements Most common vibration measurement in use. Most simple and 

inexpensive type of measurement. Greatest limitation is the 
lack of sensitivity and information available in the data. Unless 
a problem is severe, r.m.s. may not change significantly. 

Peak level detection Particularly useful for monitoring the change in the amount of 

impulsiveness, possibly due to increased bearing damage. This 
method is not 100% reliable, as other effects can also increase 
the peak level of a signal. 

Crest factor The crest factor (sometimes called the impact index) is the 

ratio of the peak level to the r.m.s. level. This method also has 
limitations. 

Shock pulse & spike energy Basically a measure of the vibration level at the bearing reso¬ 

nance, usually above 30 kHz. Widely used, however, concern 
has been expressed as the reading can decrease in later stages 
due to a reduction in impulsiveness, and other conditions, such 
as turbulence and cavitation in pumps, can give false readings. 

Kurtosis Statistical parameter, derived from the statistical moments of 

the probability density function of the vibration signal. The 
Kurtosis technique has the major advantage that the calculated 
value is independent of load or speed variations. 

Demodulation (envelope detection) Often the bearing signals are swamped by more dominant low 

frequency signals. This method, which can be implemented as 
a Hilbert transform, filters out low frequency signals, leaving a 
clean signal dominated by the bearing frequencies. 

Phase Phase indicates the relative timing between two points. It is 

used in balancing and is useful when diagnosing imbalance, 
misalignment, looseness and other cases. 

Time waveform Using an oscilloscope, it is possible to view the waveform of 

the vibration. Difficult to use in isolation, it can be very helpful 
tool in combination with others. 

Orbits Taken using two channel oscilloscope connected to proxim¬ 

ity probes. More recently, they have been derived from a pair 
of frequency spectra. The major benefit is that they show the 
relative motion of the dominant vibration of the shaft. 


Choice of accelerometer is not restricted to measurement in the acceleration mode only, 
because an acceleration signal can be converted into velocity or displacement with no loss 
of accuracy. The best mode to be used for measurement is the one that gives a spectrum 
with little or no slope, as this takes up the least dynamic range in a measuring system. It 
is found that velocity is the best mode. 

A vibration signature, as obtained by the accelerometer or any other transducer, is a 
time-base signal which may be processed for its overall level, peak level, phase, spike 
energy etc. Some of the diagnostic indicators can be evaluated by making time-based 
analyses. Table 1 summarizes these indicators, which can be adopted as useful parameters 
in trend analysis. 

A vibration signature may be a combination of various harmonics, which is obtained by 
processing the signal through a frequency analyzer. Each part of the frequency is a potential 
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Table 2. Common faults and frequencies (Tranter 1989). 


Frequency 

Possible cause 

Comments 

lx rpm 

Imbalance 

Misalignment 
or bent shaft 
Strain 

Looseness 

Resonance 

Electrical 

Steady phase that follows transducer. Can be caused by load 
variation, material build-up or pump cavitation 

High axial levels, 180° axial phase relation at the shaft ends. Usually 
characterized by high 2 x rpm 

Caused by casing or foundation distortion, or from attached 
structures (e.g. piping) 

Directional changes with transducer location. Usually high harmonic 
content and random phase. 

Drops off sharply with change in speed. From attached structures 
Broken rotor bar in induction motor. 2 x slip frequency sidebands 
often produced 

2x rpm 

Misalignment 
or bent shaft 

High levels of axial vibration 

Harmonics 

Looseness 

Rubs 

Impulsive or truncated waveform; large number of harmonics 

Shaft contacting machine housing 

Sub-rpm 

Oil whirl 
Bearing cage 

Typically 0.43-0.48 of rpm; unstable phase 

Fundamental train = 0.5 x rps [l-(ball dia)/(pitch dial) x COS 
(contact angle)] 

Nx rpm 

Rolling 

element 

bearings 

Gears 

Belts 

Blades/vanes 

Inner race = 0.5 x # balls x rps [l-f (ball dia)/(Pitch dia) x COS 
(contact angle)] 

Outer race = 0.5 x # balls x ips [l-(ball dia)/(pitch dia) x COS 
(contact angle)] 

Ball defect =0.5x (pitch dia)/(ball dia) x rps [l-(ball dia)/(pitch 
dia) X COS (contact angle)^] 

Usually modulated by running speed 

Gearmesh (# teeth x rpm); usually modulated by running speed 

Belt X running speed and 2x running speed. 

# Blades/vanes x rpm; usually present in normal machine. 
Harmonics usually indicate that a problem exists 

Nx powerline 

Resonance 

Electrical 

Shorted stator; broken or eccentric rotor 

Several sources, including shaft, casing, foundation and attached 
structures. Frequency is proportional to stiffness and inversely 
proportional to mass. Run-up tests and modal analysis are useful 
in this area 


indicator of the machine condition. Spectrum analysis is the most powerful technique 
for diagnostic study. The underlying principle is that each operating component of the 
machine generates identifiable frequencies. Thus, changes in the vibration level at a given 
frequency can be related directly to the concerned machine components. Based on the basic 
knowledge associated with the nature of machine operation and the principles enunciated 
in § 2, one should be able to calculate the frequencies consequent to the impending faults. 
In the literature, charts supplying this information are available. Table 2 is one such chart 
giving a summary of fault frequencies and the relevant faults (Tranter 1989). 

Sometimes, a change in the individual frequency component does not bring about any 




that the bearing condition was steadily deteriorating. In this particular case, balls and races 
were found badly spalled and the cage was fractured at two places. 

There are a number of indices derived from the frequency spectrum. They are very useful 
parameters for trend analysis. Table 3 lists these parameters and describes their diagnostic 
characteristics. 

For trend analysis to be performed, a reference base signature of the machine in good 
condition is needed. To ascertain whether the machine is in good condition, three or four 
machine signatures are taken at weekly intervals under the same operating condition. If 
no change is noted in the vibration spectrum, tbe machine’s internal condition must be 
reasonably stable and the machine can be considered in good health. The first spectrum 
can be used as a base line spectmm for trend analysis. 

Trend monitoring strategy is adopted where it becomes difficult to generate acceptable 
vibration levels. The underlying idea in trend monitoring is that the rate of deterioration in 
vibration level of the machine increases rapidly near breakdown. The vibration signature 
is recorded regularly, and the values for the chosen indices as listed in tables 1 and 3 are 
detenmned, and the trend noted. The lead time before the degradation reaches breakdown 
is one of the main advantages of the trend monitoring. During the lead period maintenance 
actions are planned. 


5. Vibration monitoring programmes 

The monitoring programmes can be classified into three levels of sophistication. This is 
reflected in the speed with which they can detect the faults and provide information for 
locating them. 

The simplest system consists of an accelerometer and a vibration meter (figure 3 a). 
It measures the vibration level over a specific wide-frequency range. Measurements are 
compared with the established reference values for each machine. If such reference values 
are not generated for each machine, they can be taken from the general standards like 




Vibration - A tool for machine diagnostics and condition monitoring 399 


ble 3. Frequency based vibration analysis (Tranter 1989). 

ectrum Derived from the vibration waveform by performing a ‘fast 

Fourier transform’ (FFT). Given that the running speed of the 
machine is directly proportional to the frequency measured, 
it is possible to relate peaks in the spectrum to the machine 
components 

iterfall plot A waterfall plot (also known as spectral map and cascade plot) 

is a three-dimensional representation of spectra, usually with 
time as the third dimension 

pstrum The FFT of the logarithm of the power spectrum. Used to high¬ 

light periodicities in the spectrum. Useful in bearing and gear¬ 
box analysis 

fference spectra By mathematically subtracting two spectra, changes in level 

are easily identified. Fault frequency analysis is performed to 
relate the frequencies to the machine components. Does not 
cope well with running speed changes 

) 0 t mean square of difference The rms of the difference between the current spectrum and the 

baseline, and the current spectrum and the previous spectrum, 
have both been found to be useful trending parameters 

atched filter Another method of trending the difference between vibration 

spectra. Found to be a reliable trending parameter. Quantifies 
the differences between spectra by summing the squares of the 
corresponding amplitude ratios in the spectra and taking the 
logarithm of the result 


DI 2056. This is the cheapest system and the operator does not require much training, 
achine condition is evaluated in the field from minimum data. The system however has 
me limitation. If the change in vibration level is from more than one source, the method 
nnot diagnose the actual source of trouble. For diagnosing the source of trouble some 
struments are required to perform the time-based analysis to obtain the parameters, as 
entioned in table 1. Some of these facilities are provided in the vibration meter. In the 
ccific case of rolling element bearings, and where vibrations from other sources do not 
)minate, early warning of bearing deterioration can be obtained. The meter is switched to 
ensure simultaneously both the peak and rms vibration levels up to 10 kHz. At the earlier 
age of fault development in the bearings, very little change in the rms level is noted, 
though the peak level increases significantly. By evaluating the crest factor (peak/rms 
tio) many rolling element bearing faults can be detected at an earlier stage. 

Fault detection at an earlier stage together with diagnosis is also possible, when using 
system which can perform frequency analysis. Full frequency analysis and spectrum 
ot-out are done on the spot for each monitoring point. The system is shown in figure 3b. 
Lirrent spectra are compared manually with the recorded reference spectra to reveal tell- 
le increases in the level of individual frequency components. Reference (baseline) spectra 
e recorded for each monitoring point and transferred to transparent sheets. Subsequent 
lectra are placed under the reference sheets, where any difference will be immediately 
)parent. As the levels of certain frequency components begin to grow, they are plotted 
1 a level versus time chart, so that failure trends can be predicted. This enables the fault 
be diagnosed, spare parts to be ordered and repair to be scheduled conveniently. Some 





Tape recoroer fft analvser computer 


Figure 3. Vibration monitoring systems, (a) Simple system for vibration measurement; (b) 
system for frequency analysis; (c) system for computer-aided off-line trend monitoring. 


users perform simple wide band monitoring on a regular basis and employ the frequency 
analyzer when sufficient changes in the vibration level are noted. While analysis at this 
stage will aid diagnosis of the developing fault, the early warning and trend benefits of 
regular analysis are not available in this case. 

As the number of monitoring points increases, a computer-aided spectrum comparison 
system will be the most economical solution. Vibration signatures from each machine 
are collected on an instmmentation tape recorder. They are analyzed on a real-time fre¬ 
quency analyzer in the office, and the current spectra are compared with the baseline 
spectra under semi-automatic control from a desk-top computer coupled to the analyzer 
(figure 3c). Advanced programmes aid fault diagnosis and trend monitoring. Narrow-band 
frequency analysis on a linear frequency scale as provided by the FFT analyzer gives 
excellent display of harmonic and side-band frequency components, a particularly valu¬ 
able information for diagnostic purposes. Where the analyzer includes a zoom facility, 
any part of the spectrum can be expanded to further enhance the details of individual 
components. 



6. Permanent monitoring system , 

Such a vibration monitoring system is permanently installed on a specific machine and 
continuously watches its condition. Its function is to give immediate warning of any sud- / 

den changes in the condition of expensive non-duplicated machinery, whose continuous 
operation is vital to the production process. Faulty conditions are detected immediately or j 

within minutes of occurrence and trigger alert or alarm signals in the plant control room, so 
that appropriate measures can be taken before a catastrophic failure occurs. These systems ■ 

are widely used in the power generation and petrochemical industries on turbines, feed 
pumps, gas compressors etc. A typical system is shown in figure 4. ^ 

In a basic system, a single module may continuously monitor vibration over a single ^ 

specified frequency range. If preset limits are breached (e.g. minimum, alert and alarm), 
the system can trigger visual or audible alarms. Alternative options can provide for up 
, to three individual frequency bands to be monitored simultaneously. Coverage of many 
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A - ALARM 
T - TRIP 

D - LEVEL DETECTOR 

Figure 4. Permanent monitoring system. 

nonitoring points can be provided by a multiplexer connected to the input of a single mon- 
toring module. A multiplexer continuously steps through the chosen channels, dwelling at 
5ach channel for a preselected period before automatically moving on to the next. Several 
nultiplexers can be connected to obtain the economic multichannel systems. Spectrum 
malyzers can be coupled to the monitoring system when fault diagnosis is required and, 
n the most sophisticated systems, computer-controlled fault detection is applied automat- 
cally to all monitoring points. 

Main requirements for all permanent monitoring systems are extremely high opera- 
ional reliability, long-term stability and immunity to adverse environmental conditions and 
rregularities which can cause false alarms. To safeguard from false alarm, these systems 
ire sometimes provided with an automatic test system, so that the plant operator can imme- 
liately check whether the instrumentation is functioning correctly in the event of an alarm. 

7. Future trends in vibration monitoring and fault diagnosis 

Future trends, which are visible in the present day practice, are given below. 

7.1 Permanently installed monitoring devices 

Machines and equipment with built-in monitoring devices are increasing day by day. 
Generating sets, steam and gas turbines, special purpose machine tools and production 
machines, refinery equipment are just some of the examples. It is foreseen that through 
increased and improved vendor relations in the near future, critical equipment will have 
installed monitors at the insistence of the customer or through the intention of the vendor. 
This trend is surely going to result in better reliability, availability and maintainability of 
the machinery. 










manually. Further, it is true that the use of a number of parameters and techniques in 
combination will give the best indication of machine condition. This means that large 
amounts of data must be collected, analyzed and interpreted. Computers can help in a 
big way to manage such huge amounts of data and store useful information after data 
reduction. This information can be retrieved in no time for trend analysis. Use of computers 
is gradually increasing. In some cases, monitored data are manually fed to the computer, 
while in others it is automatically done with the use of multiplexers and A/D interfacing. 
The computer gives out alert reports, trend graphs and diagnostic information. 

7.3 Expert systems 

Fault tracing on the basis of monitored and analysed data is not always straightforward. It 
needs guidance from the experts who, based on their knowledge and experience, are able 
to pin-point the source of trouble. An expert, besides possessing knowledge of the subject, 
also has the ability to think in a progressively logical manner, which is a positive advantage 
in the world of problem-solving associated with plant and machinery faults. Perhaps this 
is where the area of ‘artificial intelligence’ has something to offer. 

An expert system consists of a knowledge-base, an inference engine and a user interface. 
The knowledge-base contains a set of rules and facts that an expert system uses to solve a 
particular problem. The mles describe the relationship between the machinery problems 
and the corresponding symptoms. The inference engine contains control mechanisms to 
control the operation of the system and to infer information from the knowledge-base 
by using the mles to reason about the facts. The user interface enables communication 
between the user and the computer. 


8. Case studies 

Three case studies are presented to show how the vibration signature could help in diag¬ 
nosing the source of troubles. 

8.1 Dryer foundation vibrations 

In this case study (Gupta et al 1986), the problem was of excessive vibrations at the 
foundation block near the discharge end, and the objective was to determine the cause of 
these vibrations. 

The dryer consists of a shell of 20 m length and 2.5 m diameter. It is supported on two 
foundation blocks, 12 m apart, through two pairs of rollers. A schematic view of the shell 
on the foundation block nearer to the discharge end is shown in figure 5. The dryer is 
meant for drying polypropylene (PP) granules. It is driven at a speed of 3.5 rpm by an 
electric motor through reduction gears and a pinion. The complete drive unit is mounted 
on the foundation block nearer to the intake end of the dryer. The PP granules passing 
through the dryer get dried by the steam passing through a number of tubes arranged along 
the shell length and near its circumference. A number of hammers are arranged along the 
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Figure 5. Schematic of the sectional view of the dryer on foundation and location of 
measuring points. 


ircumference of the shell and near the intake end. These hammers fall on the shell surface 
nder gravity during shell rotation and thus tapping the shell surface, so that PP material 
tiould not stick to the surface and choke the passage. 

Vibration measurements were conducted on the foundation block-. A time-base signal 
s recorded is shown in figure 6, and the corresponding frequency spectrum is shown in 
gure 7. The time-base record corresponded to a narrow-band signal showing the dom- 
iiance of 6 to 7 Hz vibration. Of course, the amplitudes varied in random fashion. The 
requency spectrum also confirmed the presence of 5.2,6.0 and 6.5 Hz harmonics. A simple 
tieoretical analysis revealed that these frequencies were nothing but the natural frequen- 
ies of the foundation block in three modes. Further, it was noticed in the time-base record 
tiat there was repetition of the similar pattern of the signal after every 4.0 s, corresponding 
0 a frequency of 0.25 Hz, which happened to be the rotational speed of the roller. Zones 
if these repetition are marked in figure 6. Hence, it may be inferred that some irregularity 
in the roller surface caused this repeated phenomenon. 

From the foregoing discussion it was concluded that the foundation block was getting 
ixcited in its natural modes. This was possible if it was excited by a wide-band process 
kt-i r* rk -m Q c c 1 n rr frAmnanripc nf nntnrfll modes. Tt was realized that the wide-band 
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Figure 6. Time-based record of the vibration at the foundation. 



Figure 7. Frequency spectrum of the foundation vibration. 
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These pits might have been formed in course of time due to excessive stresses and lack of 
lubrication. The pits were of different sizes and spread randomly over the surfaces. During 
operation of the dryer, if pits of minute sizes fell on the track of contact, the foundation 
had low level vibration, while in case of big pits, there was excessive vibration. 

The following recommendation was made. 

“Pit formations are because of excessive contact stresses and lack of lubrication. 
Contact stresses can be reduced by increasing the area of contact. Sufficient lubri¬ 
cation can be provided by dipping the rollers in the oil bath, so that their surfaces 
are well smeared with oil before coming in contact with the tyre. If pit fonna- 
tions still persist, they should be repaired during the shut down period by pouring 
welding material and then grinding the spots.” 

The company did adopt the above measures and did not face any more problem of 
excessive vibrations at the foundation block. 

8.2 Excessive vibrations of the piping connecting booster pump (BP) and boiler feed 
pump (BFP) (Gupta et al 1988) 

While commissioning a super thermal power unit (500 MW) at a certain power station, 
vibrations of very large amplitude were observed in the piping connecting the BP and the 
BFP under certain conditions of operation, viz. safety valve blow-out operation. The region 
of these severe vibrations is marked in the characteristic curve of the BFP (figure 8). We 
were required to detect the cause of the trouble and also suggest a possible solution to the 
problem. 

Figure 9 shows the relative positions of BP, BFP and the piping layout. Both BP and 
BFP are installed on the ground floor of the Power House, while the deaerator tank (DT) 
is situated at a height of about 26.5 m, thus providing a positive suction head of 30.5 m 
of liquid column to the BP. The water from DT goes to the BP through a barrel-type 
strainer having a mesh size of 0.2 mm. The BP is driven by an electric motor at a constant 
speed of 1493 rpm. The discharge from the BP is connected to the BFP through another 
barrel-type strainer with a mesh size of 0.5 mm. The BFP is driven by the same electric 
motor through gears and hydraulic coupling. Its speed can be controlled from 1490 to 
5905 rpm by changing the position of the scoop in the hydraulic coupling. The discharge 
pipe of the BFP is mounted with a recirculation valve, which directs the excess flow to the 
recirculation piping leading to the DT. 

We decided to record the vibrations of the piping at suitable locations under conditions 
of severe and non-severe vibration and to analyse these records in the laboratory for the 
frequency contents. We thought that a comparison of their vibration spectra would provide 
a clue to the identification of the source of trouble. 

Figure 9 shows the four locations at which vibration signatures were recorded in the 
three directions. Signatures were also recorded at the BP suction pressure tapping (location 
5 in figure 9), in order to have an idea of the frequencies of pressure pulsations in the water. 
Vibration measurements were carried out on two occasions. On both occasions, so-called 
severe vibration conditions could not be achieved. Vibration signatures were analvsed for 
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Figure 8. Motor-driven BFP performance curve on cold water (s.g. 1.0). 

Further, the natural frequencies of the piping structure were determined by the rap test, 
and were in the range of 2.5 to 3.0Hz. 

It was concluded that the severe vibrations of the interconnecting piping were due to 
resonance. Based on the experimental results and also on the experience of other research 
workers on centrifugal pumps as used in powerplants, the possibility of the following 
causes is mooted. A logical discussion is also included while deliberating on these causes. 

(i) Presence of pressure pulsations even at non-severe conditions of operation has been 
identified but did not cause large piping vibrations because of non-coincidence of 
the excitation frequencies with the natural frequencies of the piping. Pressure pulsa¬ 
tions are as a result of internal recirculation. It is well-known that centrigugal pumps 
are prone to internal recirculation at reduced flows (compared to rated flow). Conse¬ 
quences of internal recirculation are noisy operation and pressure pulsations (hydraulic 
surges) of low frequencies (1-8 Hz) causing pipeline vibration. It was noted that with 
increase in flow rate, pressure pulsation intensity was reduced, thus confirming the 
above reason for pressure pulsations. Pressure pulsations due to internal recirculation 
can be avoided by. 
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Figure 9. Booster pump, BFP and piping layout with measuring points location. 


(a) reduction of pump inlet eye area by providing suitable inserts; 

(b) running the pump at the rated flow capacity so as not to result in internal recircu¬ 
lation and deviating the excess flow at the delivery end through the recirculation 
pipeline, i.e. increasing the capacity of the recirculation valve and associated pip¬ 
ing at the outlet of the BFP. This solution has worked very well in a somewhat 
similar situation (Pattabiraman etal 1986). 

(ii) Referring to figure 8, severe vibrations of the piping have occurred at BFP discharge 
pressures of 176kg/cm^ and flow rates of 350 t/h onwards. These conditions are 
needed to make the safety valve operative but at the discharge pressure of 168 kg/cm^ 
piping vibrations are reported to be of low level. It seems that floating of the safety 


Table 4. Frequency components (in hertz) of vibration signatures. 


Location 1 


Location 2 

Location 3 

Location 4 

Location 5 

V 

T 

A 

V 

T 

A 

V 

T 

A 

H 

,V 

— 

- 

1.64 

— 

1.7 

- 

- 

- 

- 

- 

2.0 

1.9 

2.4 

2.4 

2.4 

_ 

2.4 

2.5 

2.9 

2.6 

2.6 

2.8 

- 

- 

3.4 

3.3 

3.3 

- 

- 

- 

3.4 

3.4 

- 

- 

- 

- 

5.7 

5.8 

5.8 

6.1 

6.0 

6.0 

5.8 

5.8 

6.0 

6.0 

- 

5.9 

6.9 

- 

7.1 

- 

- 

- 

- 

- 

7.3 

6.6 

- 

- 

8.1 

8.5 

8.5 

- 

- 

- 

8.5 

8.5 

8.5 

8.4 

- 

- 

9.5 

9.7 

9.5 

9.2 

9.2 

9.0 

9.4 

9.4 

9.7 

- 

9.4 

9.6 


Abbreviations: A - axial, H - horizontal, T - trans, V - vertical. 



column from boiler to pump and pump to deaerator tank causing resonance of the 
piping system. Simpson & Tramschek (1974) have conducted a study on pulsations in 
power station feed-pump systems and reported that fluid column frequencies can be 
as low as 1.5 Hz, and any low frequency excitation may set in pressure fluctuations in 
the column. 

8.3 Vibration problem at the non-driven end bearing of a generator of the gas based 
powerplant (Gupta & Yadava 1991) 

Very high vibration level of 2 x V frequency at the non-driven end (NDE) bearing of a 
generator was reported. It was required to identify the cause of this high vibration level. 

Vibration measurements were carried out at the DE and NDE bearings of the generator 
in all the three directions and also for two load settings, viz. 61 and 82 MW. This was done 
to see whether the electrical load had any influence on the vibration level. Spectral analysis 
was performed on the recorded signatures, and some useful information from this analysis 
is presented in table 5. 

In this table vibration levels in millimetres per second at frequencies 1 x V, 2 x V and 
4 X N {N being rotational speed of the rotor) are presented for loads of 61 and 82 MW. 
Dominance of the second harmonic vibration over the first harmonic vibration is clearly 
visible, particularly in the vertical direction at the NDE bearing, but the level is not affected 
much by the change in electrical load except at the NDE bearing in the horizontal direction. 
The frequency components which are greatly affected by the change of electrical load are 
underlined. These are the 4 x V component in the vertical direction at both the DE and 
NDE bearings, and the 2 x V component in the horizontal direction only at the NDE 
bearing. This information may help in tracing the cause of trouble by making use of the 
following knowledge-base. 

The generator is a two-pole machine, so any defect producing a change in air gap between 
the pole and stator leads to an unbalanced magnetic pull, whose intensity increases with 
the flux, which in turn varies directly with the load. The unbalanced magnetic pull occurs 
twice in one rotation of the rotor, and gives rise to vibrations of 2 x V frequency and 


Table 5. Vibration levels at frequencies 1 x iV, 2 x V and 4 x iV (iV = 3000 rpm). 


Location 

direction 


Vertical 


Axial 


Vibration level (mm/s) 


DE bearing load_ NDE bearing load 


Frequency 

61 MW 

82MW 

61 MW 

82 MW 

1 X N 

0.56 

0.62 

0.67 

0.68 

2xN 

3.55 

4.12 

8.42 

8.52 

4xN 

0.11 

0.27 

0.17 

0.94 

1 X N 

0.54 

0.55 

1.27 

1.20 

2xN 

3.13 

3.16 

2.99 

2.79 

4xN 

0.05 

0.07 

0.50 

0.59 

1 X N 

1.50 

1.51 

1.55 

1.76 

2xN 

5.44 

5.19 

0.13 

0.35 

Ax N 

0.09 

0.07 

0.30 

0.25 


Horizontal 
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' multiples of this frequency in the stator body or bearing support. The magnitude of the 
higher harmonics depends on the shape of the unbalanced magnetic pull. Further, if there 
are two defects located 45° from the horizontal plane and on either side of the vertical 
axis, the net dominant unbalanced pull will be in the vertical direction and its frequency is 
4 X N. The 4x N component in the horizontal direction gets moderated with the increase 
in load because of the opposing action. The enormous increase in the 2 x vibration level 
with the electrical load in the horizontal direction at NDE bearing suggests the presence 
of a defect in the stator at the level of its axis and near the NDE bearing. Similarly, the 
enormous increase in vibration level of the 4 x A/ harmonic in the vertical direction both at 
DE and NDE bearings suggests the presence of two defects located 45° from the horizontal 
plane on either side of the vertical axis and situated somewhere along the length of the 
stator. It may be noted that the 4 x A/ harmonic in the horizontal direction is somewhat 
moderated with the increase in electrical load. 

It was recommended that the generator be opened to inspect the stator for any defect 
causing a change in the air gap, and on doing this it was found that the cover strips near the 
NDE bearing were absent. These were put back and the problem of high 2x N vibrations 
was eliminated. 


9. Concluding remarks 

Vibration is a very effective tool for diagnosing many mechanical defects. It however 
needs expert knowledge and experience for fault-tracing. When vibrations are used for 
condition monitoring, it is always advisable to start with simple instmmentation and before 
applying the technique to plant or machinery maintenance, the economical viability must 
be assessed. It is envisaged that in the near future the use of computers in handling the 
large amounts of monitored data and reducing them to usable information formats will 
expand. The tendency to use expert systems for fault diagnosis will also increse. There 
will be attempts to develop expert systems dedicated to a class of specific plants and 
machinery. 
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3ii the analysis of time-periodic nonlinear dynamical 
;ystems 
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ing, Auburn University, AL 36849, USA 
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Abstract. In this study, a general technique for the analysis of time-period 
nonlinear dynamical systems is presented. The method is based on the fact that 
all quasilinear periodic systems can be replaced by similar systems whose linear 
parts are time invariant via the well-known Liapunov-Floquet (L-F) transfor¬ 
mation. A general procedure for the computation of L-F transformation in 
terms of Chebyshev polynomials is outlined. Once the transformation has been 
applied, a periodic orbit in original coordinates has a fixed point representation 
in the transformed coordinates. The stability and bifurcation analysis of the 
transformed equations are studied by employing the time-dependent normal 
form theory and time-dependent centre manifold reduction. For the two ex¬ 
amples considered, the three generic codimension-one bifurcations, viz, Hopf, 
flip and tangent, are analysed. The methodology is semi-analytic in nature and 
provides a quantitative measure of stability even under critical conditions. Un¬ 
like the perturbation or averaging techniques, this method is applicable even 
to those systems where the periodic term in the linear part does not contain a 
small parameter or a generating solution does not exist due to the absence of 
the time-invariant term in the linear part. 

Keywords. Nonlinear dynamical systems; Liapunov-Floquet transformation; 
centre manifold reduction; normal form theory; bifurcations; time-invariant 
forms. 


[. Introduction 

rhe study of systems governed by a set of ordinary differential equations is of great impor- 
ance in diverse branches of science and engineering. Numerous practical applications can 
)e found in the areas of quantum mechanics, systems and controls and dynamic stability of 
;tructures under oscillating loads. In particular, such problems arise in the dynamics of ro- 
atine svstems such as heliconter blades fBramwell 1976: Johnson 1980i and rotor-bearinn 
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nonlinear non-autonomous differential equations which contain explicit periodic functions 
of time. Periodic solutions of these equations physically represent steady-state operations 
under various conditions. The stability of these periodic solutions (or orbits) is determined 
by the equations of perturbed motion about the periodic solutions. In many situations, the 
linearized perturbed equations may be sufficient for the prediction of stability, and therefore 
the problem reduces to a set of linear ordinary differential equations with periodic coeffi¬ 
cients. The same mathematical problem also arises in the study of nonlinear autonomous 
systems when the stability of a particular periodic solution needs to be investigated. Be¬ 
sides the stability issues, the linear control problems associated with rotating systems can 
also lead to the same type of equations. For example, the Individual-Blade-Control (IBC) 
technique, used in the control of a helicopter rotor-blade system, produces a set of equa¬ 
tions with periodic coefficients (Kretz 1976; Mckillip 1985). Therefore the analysis of this 
special class of time-varying systems has been deemed extremely important. 

Hill’s method (Lindh & Likins 1970; Yakubovitch & Starzhinskii 1975), perturbation 
techniques (Stoker 1950; Nayfeh 1973) and Floquet’s theory (Coddington & Levinson 
1955; Floquet 1983) are some of the most commonly used mathematical methods in the 
analysis of such systems. It is well known that Hill’s approach is not suitable for digital 
implementation, especially if one has to deal with a large-scale system. The perturbation 
methods have their own limitations due to the fact that they can only be applied to systems 
where the periodic coefficients can be expressed in terms of a small parameter. Therefore, 
Floquet analysis coupled with a numerical integration code has served as the main tool 
in various applications (Peters & Hohenemser 1971; Friedmann et al 1977; Gaonkar et al 
1981; Sinha & Wu 1991; Wu & Sinha 1994). Floquet analysis is a powerful technique 
which can be easily implemented on a computer and holds most promise in the analysis 
of large-scale periodic systems. According to this technique, stability analysis requires the 
eigen-analysis of the ‘Floquet Transition Matrix’ (FTM) which is simply defined as the 
state transition matrix at the end of one period. Eigen-analysis and the evaluation of FTM 
are the two major computational problems encountered in the analysis of linear periodic 
systems of large dimensions. Since the FTM is nonsymmetric, the eigen-analysis problem 
is a difficult one. Some progress has been made in this direction (Cullum & Willoughby 
1986; Gaonkar & Peters 1986), however, the computation of FTM for large-scale sys¬ 
tems is still a challenging task. Most commonly a fourth or higher-order Runge-Kutta 
type numerical code has been used in a ‘single pass’ scheme for an efficient computation 
of the FTM (Friedmann et al 1977; Gaonkar et al 1981). Very recently, Sinha and asso¬ 
ciates (Sinha & Wu 1991; Joseph et al 1993; Sinha et al 1993; Wu & Sinha 1994) have 
developed a new technique for the analysis of large-scale periodic systems. It has been 
shown that the proposed technique is numerically several times faster than the standard 
codes and at the same time it can also be applied in the symbolic form (Sinha & Juneja 
1991). 

Although the linearized equations play an important role in the stability analyses, they 
fail to provide answers to many questions associated with the nonlinear periodic systems. 
Questions such as how does the solution depend on a control parameter? What kind of 
motion takes place after the loss of stability of the periodic state? Is it possible to identify the 
types of stability loss and critical values of the control parameters in each case etc? In order 
to answer such questions one must investigate the nonlinear equations of perturbed motion 
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including a bifurcation analysis of the periodic orbit. In many instances certain methods of 
nonlinear dynamics can be applied to obtain information of significant value. It is known 
(Arnold 1988) that the simple-st loss of stability of a periodic orbit constitutes a degenerate 
problem of codimension 1. However, if a pair of complex multipliers simultaneously 
cross the unit circle of the complex plane, then we have the Hopf bifurcation of periodic 
orbits. The details are given by Guckenheimer & Holmes (1983) and Arnold (1988). For 
the case of a codimension 2 bifurcating periodic orbit, some qualitative mathematical 
results have also be obtained (Chow & Wang 1985). Qualitative analyses certainly provide 
good insight into the problems, but for engineering applications quantitative methods are 
indispensable. 

Perturbation and averaging methods (Bogoliubov & Mitropolsky 1961; Nayfeh 1973; 
Sanders & Verhulst 1985) are suitable for relatively smaller systems and in general, their 
applications are limited to systems where the periodic terms as well as the nonlinearities 
can be expressed in terms of a suitable small parameter. Hopf bifurcation of Duffing’s os¬ 
cillator and nonlinear Mathieu’s equations are discussed by Awreicewicz (1989) through 
an application of perturbation and harmonic balance methods. On the other hand, one can 
apply standard numerical techniques associated with boundary value problems to analyse 
the situation. These techniques are basically shooting methods and provide strategies for 
calculating branch points and new branches of bifurcating solutions. These methods have 
been exploited by several authors (Seydel 1981, 1987,1988; Doedel & Kemevez 1986). 
Shooting methods are quite reliable but they are certainly not free from numerical insta¬ 
bility difficulties. At the same time, when a system is non-autonomous, the trajectories 
can cross themselves and it is difficult to obtain a general stmcture of the motion through 
a purely numerical scheme. An attractive alternate method of analysis is provided by the 
technique called ‘point mapping’. The idea was introduced by Poincare (1899) and later 
developed by Birkhoff (1966), Arnold (1988) and Bemussou (1977). In this approach 
the continuous-time periodic system is reformulated as discrete-time events by defining 
a point mapping called the Poincare map. Thus the original non-autonomous differential 
system is replaced by a set of difference equations which do not explicitly depend on 
time. In principle these are easier to analyse and simulate on a digital computer. However, 
one faces serious computational difficulties in application of this technique to real engi¬ 
neering problems even if the dimensions are small. In order to obtain the corresponding 
difference equations, one must construct an exact or approximate solution of a system of 
nonlinear differential equations. Exact solutions are only possible in very special cases, 
such as those of impulsive excitation problems discussed by Hsu and his associates (Hsu 
& Cheng 1973, 1974; Flashner & Hsu 1983; Hsu 1987). Since one must settle for an 
approximate representation of the point mapping, recent studies (Lukes 1982; Flashner & 
Hsu 1983; Guttalu & Flashner 1989,1990) have suggested the use of Runge-Kutta type 
algorithm and perturbation technique for obtaining a truncated version of the Poincare 
map. Following this approach one can discuss the bifurcation of periodic solutions to 
other possible periodic motions or to quasiperiodic and aperiodic solutions (Lindtner et al 
1990). 

One other viable approach is to use the Liapunov-Floquet theorem which allows trans¬ 
formation of the quasilinear periodic systems into a new set of similar equations, whose lin¬ 
ear parts are time-invariant. However, it is not a simple task to compute this transformation 
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matrix for a general periodic system. For certain special class of linear systems, it is possi¬ 
ble to obtain the Liapunov-Floquet transformations as indicated by Lukes (1982). In order 
to determine such a transformation for a general periodic system, one must compute the 
STM as an explicit function of time. Recently we (Pandiyan et al 1993; Sinha & Joseph 
1994; Sinha & Pandiyan 1994; Pandiyan & Sinha 1995) have been successful in developing 
a computational procedure through which the Liapunov-Floquet (L-F) transformation can 
be obtained in terms of Chebyshev polynomials which is suitable for algebraic manipula¬ 
tions. The inverse of the L-F transformation can also be computed by a similar procedure 
considering the adjoint system equation. 

The development of a procedure for computing these transformation matrices has given 
a clear edge in dealing with a wide range of problems associated with periodically vary¬ 
ing systems. In this paper, a quantitative analysis of nonlinear dynamical systems with 
periodic coefficients has been presented through an application of the Liapunov-Floquet 
(L-F) transformation. It is shown that the original quasilinear periodic system can be trans¬ 
formed to a dynamically similar form in which the linear part is time-invariant. The anal¬ 
ysis of the ti-ansformed equations has been carried out through the use of time-dependent 
normal form theory. The solutions thus obtained are mapped back to the original coor¬ 
dinates by applying the inverse L-F transformations and compared with the numerical 
results obtained by a Runge-Kutta type algorithm. The method is also applicable to sys¬ 
tems undergoing bifurcations. Such problems are referred to as ‘critical cases’ and have 
been studied through an application of the centre manifold theory (Malkin 1962; Carr 
1981). For brevity, only codimension I bifurcations are considered. In order to demon¬ 
strate the effectiveness of the proposed analysis procedure, two examples have been studied 
in detail. The first example consists of a nonlinear Mathieu equation, the L-F transfor¬ 
mation of which has been computed using the Chebyshev polynomials as described in 
§ 3. The solutions of this example have been obtained in stable and centre manifolds for 
some typical sets of system parameters. It has been shown that the proposed technique 
is applicable to a wide class of problems including the situations where the generating 
solutions do not exist and/or the parameter multiplying the linear periodic terms are no 
longer small. It is also shown that in many cases it is possible to obtain approximate 
analytical solutions which compare extremely well with the numerical solutions. The 
results obtained by the traditional averaging method are also presented for comparison 
purposes. 

As a second example, the bifurcation problem of a double inverted pendulum subjected to 
periodic loading is selected. The Hopf bifurcation in a double inverted pendulum subjected 
to a tangential static load has been studied by Sethna & Shapiro (1977) and thereafter 
many researchers have contributed on various bifurcation aspects of such an autonomous 
system. However, when the double pendulum is subjected to a periodic load, the system 
becomes non-autonomous. Periodic bifurcations of such a pendulum has been reported 
by Flashner & Hsu (1983) by the method of point mappings. In this paper, the dynamics 
of this four-dimensional system undergoing a single Hopf bifurcation or a single flip 
bifurcation is investigated in a two-dimensional centre manifold or a single-dimensional 
centre manifold, respectively by applying the time-dependent normal form theory and 
centre mamfold reduction. The results of such analyses are verified by using numerical 
simulations. 
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2. Background 

2.1 Mathematical structure of periodic systems 


In general, many problems of mechanical systems can be reduced to a set of nonlinear 
ordinary differential equations of the form, 

y = g(3^,^,0, (1) 

where y is an n state vector, g() is a continuous nonlinear function of y and t and X is a 
set of control parameters. 

Let y (0 denote a known periodic solution (of period L) of (1) such that the perturbation 
x(/) about the periodic motion y(t) can be defined as 

y(0 = y(0+-^(0- (2) 

Substituting (2) in (1) and expanding the right-hand side in Taylor series about y(t) 
yields 


x/(0 = 


dyj 


+ r 


1 d^gi 


Ht) 


2 dyjdyk 


XjXk + 


1 




m 


3! dyjdykdyi 


XjXkXi 4 - 


yh)' 


(3) 


The above equation may be rewritten as 

x = A(r, X)x 4 -f 2 (x, t, A) 4-f 3 (x, t,X)-\ - t, X) 4- 0(|x|^+\ t) 

(4a) 

or 

X = A(t, X)xF(x, t, X) (4b) 

where f/i:( ) contain homogeneous monomials in xi of order k. A(t,X),fk{x,t, X) and 
F(x,t,X) (suitably defined in terms of fif )’s) are T periodic functions. The linear part of 
(4) is, of course, given by 

X = A(t, A,)x; A(f, X) = A(f 4- T, X). (5) 

The stability and response of (5) can be discussed using the well-known Floquet theory. 

We are interested in the analysis of (4) and determine its behaviour as the control 
parameter X varies. 

2.2 Results from Floquet theory (Floquet 1983; Coddington & Levinson 1955) 

Theorem 1. Each state transition matrix (STM), O of (5) can be written as the product 
of two n X n-matrices as 

$(t) = L(l)e'‘^ (6) 

where lL(t) is T-veriodic and C is a constant n x n matrix, hit) and C, in general, are 
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Remark 1. There exists a complex matrix C such that 


where M can be expressed as 

(7) 

■f 

M = 0“^(o)0(r). 

(8) 


Remark 2. fit, the eigenvalues of M are called the characteristic multipliers and the sta- 
bihty condition can be expressed as l/x; j < 1, i = 1,2,..., n. 


COROLLARY 1 

Each state transition matrix 0(f) can also be factored as 



0(0 = 0(06*^^ 

(9) 


where the matrix Q(f) is real and periodic with period 2T and R 
matrix. 

is an appropriate real 

V, 

COROLLARY 2 

The Liapunov-Floquet transformation 


f 

x(f) = L(f)z(f) 

(10) 


reduces the original time-varying system (5) to 



i(f) = Cz(f) 

(11) 


which is time-invariant. 

Moreover, the 2T-periodic transformation 



x(t) = Q(f)z(f) 

(12) 


produces a real representation given by 


--)■ 
f i 

z(t) = Rz(t) 

(13) 



3. Computation of L-F transformation matrix via Chebyshev polynomials 

It has been shown by Sinha and associates (Sinha & Juneja 1991; Sinha & Wu 1991; 
Joseph et al 1993; Sinha et al 1993; Wu & Sinha 1994) that the STMs of linear periodic 
systems can be obtained in terms of the shifted Chebyshev polynomials of the first kind. 
The technique is efficient and since the STM is expressed as an exphcit function of time 
f, it is suitable for algebraic manipulations as well. In fact, if the dimension is small, the 
STM can be expressed in a closed form as an exphcit function of system parameters as 
shown by Sinha & Juneja (1991) for the case of Mathieu equation. 

In order to compute the L-F transformation matrix, L(t), one needs to find the STM 
0(f) associated with the linear system given by (5). If A(t) in (5) is commutative, then 
L(t) can be computed as 

L«) = (14) 
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where Brit) can be obtained in a closed form as shown by Lukes (1982). For a general 
A(r), first, the Chebyshev polynomial expansion technique is used to compute the STM 
<E>(0- In this technique, the solution vector x(r) and the periodic matrix A(r) in equation (5) 
are expanded in terms of the shifted Chebyshev polynomials in the interval [0, T] as shown 
below. 

5 — 1 

Xi(t) ^ = s*^(Ob', i = 1, 2,..., n 

r=0 

■s —1 

A(0 44; (0 = s*'^(t)d‘j, i, j = 1,2,..., n 

r=0 

where b^. are unknown expansion coefficients of Xi (t), dr^ are known expansion coefficients 
of Aij (t) and s* (t) are the shifted Chebyshev polynomials of the first kind. For convenience 
in algebraic manipulation an n x nm Chebyshev polynomial matrix is defined as 

S(t) = I ® 8*^(0, (17) 

where < 8 ) represents the Kronecker product (Sinha & Wu 1991), and I is an n x n identity 
matrix. Using the definitions in (15), (16) and (17), x(t) and A(t) can be rewritten as 

x(r) = S(0b, A(f) = S(0D (18) 

A(0x(0 = S(0Qb (19) 

where b = {b^b^b^ • • • b"}^ is an nm x 1 vector, D = • • • d‘j], ij = 1, 2, 3, 

4,... n, is an nm x n matrix and Q is an nm x nm product operation matrix (for details 
see Sinha & Wu 1991). 

Substituting equations (18) and (19) in the integral form of equation (5), the unknown 
constant b can be determined by a set of linear algebraic equations. Therefore, the solution 
vector X can be determined from equation (15). However, the computation of ^(0 requires 
a set of solutions of (5) with n initial conditions: x, (0) = (1, 0, 0,..., 0), (0, 1, 0,..., 0), 
(0, 0, U 0,..., 0),..., (0, 0,..., 1). By defining the resulting set of Chebyshev coefficient 
vector b'.s of the n solutions in a matrix form, the STM can be written as (see Sinha & 
Juneja 1991; Sinha & Wu 1991; Joseph etal 1993; Sinha et al 1993; Wu & Sinha 1994), 

<&(0 = S(0B, (20) 

where B = [616263 • • • 6 „] and (0) = I. It has to be noted that the STM is valid only for 
0 < t < T since the shifted Chebyshev polynomials of the first kind are defined over the 
interval [0, T]. When r > T, the STM can be evaluated using Floquet theory (Yakubovitch 
& Starzinskii 1975) as 

= t = g + nT, ^e[0, T], n = l,2,...,€. 

( 21 ) 

Once O it) is known, the T -periodic complex matrix hit) or the 2r-periodic real matrix 
Q(0 can be computed in the following way (Pandiyan et al 1993; Sinha & Joseph 1994; 
Sinha & Pandiyan 1994; Pandiyan & Sinha 1995). Since <&(0) = I, L(0) = LT = I, the 


(15) 

( 16 ) 


where C is a m x n constant complex matrix. By performing an eigenanalysis on $(T), 
the matrix C can be computed easily. Then the T -periodic L-F transformation matrix is 

L(0 = (23) 

In order to evaluate the 27-periodic real L-F transformation matrix Q(T), first we note 
that (cf Yakubovitch & Starzinskii 1975), 

0(27) = ^^(T) = , (24) 

where C* is the conjugate matrix of C, the n x n constant real matrix R = [C + C*]/2 
and the 2-7 periodic L-F matrix can be represented as 

Q(f)=<t>(f)c-«'; 0 < f < T; 

Q(r + r)=4>(T)Q(rV-'*''; r < (T + r) < 2r; 0 < r < r. 

It should be noted that Q(0 = Q(r-t-27). 

If one is interested in finding or then there are two avenues. L(r) and 

Q(0 may possibly be inverted through the use of a symbolic software like MACSYMA/ 
MATHEMATICA/MAPLE. However, this is neither reahstic or even possible at this point 
in time. The other approach is to first find the STM 'L(r) of the adjoint system 

w = — A^(Ou;, (26) 

and use the following relationship (cf Yakubovitch & Starzinskii 1975), 

0-1(0 =-0^(0. (27) 

The computation of is critical in determining L-i(0 or Q~^(t). For example, 

L~i (r) can be evaluated utilizing the properties of the adjoint system as shown below, 

L-i(r) = = e^^^'^(t). (28) 


Such an approximation of L-F transformations has been found to be extremely convergent 
(Pandiyan et al 1993; Sinha & Joseph 1994; Sinha & Pandiyan 1994; Pandiyan & Sinha 
1995) and since it is periodic with period 7, the elements L,y(0 and Qij(t) have the 
truncated Fourier representations 


Uj(t) 


Qijit) 


q 

^ Cfl exp(i27rnr/7), / = \/T7i 


n=-q 


2 


Tvnt ^ lint 
-h an cos ^ sin — 


n=l 


n=l 


(29) 

(30) 


respectively. 

Since complex matrix L (t) (or the real matrix Q (t)) c an be computed as a function of f, all 
algebraic manipulations involving this matrix can be done in symbolic form using MATH- 
EMATICA or MACSYMA. L~i (r) and (t) have similar Fourier representations. 

In this study only the real L-F transformation Q(t) has been used to make it more 
appealing to the engineering community. A 12- to 15-term Chebyshev expansion has been 
found to yield extremely accurate representation of Q(t). The accuracy of Q(t) is directly 
dependent upon the convergence and accuracy of the STM 0(t) itself. A convergence 
study in the computation of 0(t) has been reported by Joseph et al (1993). 
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4. Time-dependent normal forms and centre manifold reduction 

4.1 No rma I forms 

The fact that A(0 is time dependent in (4), a direct application of normal form theory is 


not possible. Using the transformation 


x(0 = Q{t)zit), 

(31) 

(4) takes the form 


z = RzA- Q~\t){t 2 (z, t) + f 3 (z, ?)), 

(32) 


where R is an n x « constant matrix and nonlinear terms of order four and higher have 
been neglected, since for generic codimension 1 bifurcations, the fourth order terms do 
not affect the local stability behaviour (Arnold 1988). The form of (32) is amenable to 
direct application of the method of time dependent normal forms (TDNF) for equations 
with periodic coefficients as shown by Arnold (1988). 

Equation (32) in its Jordan canonical form can be written as 

y — 3yW 2 (y,t)W3(y,t) (33) 

where J is the Jordan form of matrix R and w^iy, t) are 27"-periodic functions and con¬ 
tain homogeneous monomials of yi of order 2 and 3. Using a sequence of near identity 
transformations of the form 

y = v + hr{v,t), (34) 

where hr (v, t) is a formal power series in v of degree r(r = 2, 3) with periodic coefficients 
having the principal period 2T, (33) can be reduced to its simplest form 

V = 3vW2(v,t)W2(v,t). (35) 

It is important to note that the ■W 2 (v, t) and w^iv, t) contain only a finite number of 

Fourier harmonics. This is due to the fact that the solution of the resulting homological 
equation depends on the resonance condition relating the eigenvalues of J and the Fourier 
frequencies of Wr(v, t) (Arnold 1988). It should be pointed out that the solution of the 
time-dependent homological equation requires the solution of a large set of linear algebraic 
equations even for a 2 x 2 system. For example, if for such a system, the L-F transformation 
matrix Q(t) is represented by a fifteen-term complex Fourier expansion and let us say that 
the degree of the monomials r = 3, then one needs to solve (2 x 124) equations in blocks 
of 31. 

4.2 Centre manifold reduction 

In situations where some of the eigenvalues of J in (33) are critical, the stability of (33) can 
be discussed in the centre manifold via time-periodic centre manifold theorems. Applica¬ 
tion of the normal form procedure to the reduced set of equations in the centre manifold 
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following, a theorem due to Malkin (1962) has been utilized to develop a practical method 
for finding the centre manifold relations for the time-periodic systems. 

Let us assume that (33) has n\ eigenvalues that are critical and m eigenvalues that have 
negative real parts. Therefore, (33) may be rewritten in the form 



where the subscripts c and 5 represent the critical and stable vectors, respectively. Accord¬ 
ing to the centre manifold theorem^ there exists a relation (Malkin 1962; Pandiyan & Sinha 
1995) 

= h(yc,t), 


such that /i(yc) 0 is of the form. 




(0>^1 


ym 


mi -I-h mnj > 1, 


(38) 


where (0 are periodic coefficients with period IT. The relation given by 

equation (32) can be obtained as the formal solutions of the equations (see Malkin 1962; 
Pandiyan & Sinha 1995) 


^ys . w \ 


dt 


Jc>^c+Wc 


(39) 


where Wc = Wd + Wd and Wj = Wsi + are nonlinear vector monomials of the 
critical and stable states of the system, respectively. It is important to note that the resulting 
solutions will be meaningful only if the coefficients Bs ^ ^ (r) are also periodic. Although 
there exists an infinite number of expansions similar to (38) which have finite coefficients 
and also satisfy (39), there is only one with periodic coefficients. This result was first 
reported by Malkin (1962). 

As a result of substitution of (38) in (39), a set of differential equations in terms of the 
unknown coefficients sf * (0 is obtained in a form 


dB 




dt 


AjtS^ — O5 


(40) 


where Xj, j = 1, 2,..., ^2 are the eigenvalues of the stable part of the system and 
m„^ known integral rational functions of the periodic coefficients on the 

right hand side of (40). The coefficients Bg^^ can be obtained by formally solving 

the above set of differential equations. For this purpose, Bg '^' ^ ^ is assumed in the form 

of a finite Fourier expansions as 


(mi •••»:„ j) 

"5 


/ . {2Tznt\ / ^,Jl HL \ 

(f) = flO + 2^ fln cos I — ^ i -b hn Sin ( 1 

n=l \ T / \ T J 


(2nnt\ 


(41) 


where f = 2T. Substituting (41) in (40) and equating like terms on both sides of the 

_ __j.* _ _ __.1. _i_1_• ^ ___• _ ^ ..1 . 1 _ . . A ^ _3 7^ 
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obtained. The constants an and bn can be computed by solving these algebraic equations and 

therefore the coefficients Bg can be determined in the form of (41). Substitution of 

(32) in (36) clearly decouples the stable and critical states and, hence, the problem reduces 
to the investigation of stability of an dimensional system in the centre manifold. The 
resulting system of n \ periodic equations is of the form 

yc=Jcyc + w* (42) 

where vector W* contains nonlinear monomials which are functions of jc only. 


5. Applications 

To demonstrate the applicability and effectiveness of the suggested approach, two ex¬ 
amples are considered. As a first example, a nonlinear Mathieu equation is considered 
for which the L-F transformation has been obtained using the computational algorithm 
discussed in § 3. Although this is a simple example, it brings out the key points clearly 
and shows the superiority of the proposed methods over the classical methods such as 
averaging, perturbation, etc. Whereas the results of the proposed technique, based on L-F 
transformation and normal forms, provide reasonably good solutions even for moderately 
large parameters multiplying the nonlinear terms, the traditional averaging procedure is 
applicable only when the parameters multiplying the periodic terms and the nonhnear 
terms are both small. 

In the second example, the dynamic behavior of a time-periodic double pendulum under¬ 
going various bifurcations is examined. In particular the critical dynamics under secondary 
Hopf and flip bifurcations are studied in detail. 


5.1 Example 1. Mathieu equation with cubic nonlinearity 


Consider the Mathieu equation with cubic nonlinearity in the form, 

X -f -f- (a -(- cos o)t)x -(- € x^ = 0, (43) 

where S, a, p,co and e are the parameters of the system. In the state space form the above 
equation is rewritten as 



0 

—(a 4- iS coscot) 



(44) 


where {xi,X 2 }^ = {x,x)^. Following the steps described in §3, the 2-T periodic real 
L-F transformation matrix Q(t) can be computed for a given parameter set. Applying the 
transformation x = Q(t)z, (44) is transformed to 



■^11 

Rii 


Rn 

Rii, 


zi 

Z2 




0 

-^iQ\\z\ 4- Qnzif 


(45) 


where Ri ,• are the elements of the real matrix R and Oi ; are the elements of the L-F trans- 


(45) can be written in the canonical form as 




0 


0 


^2j 1^2 




7 ]l(^ T:)yl + /i 27 

721 + /22(^’ '^)>’?y2 + /23(^’ '^)yi>'2 + '^)3'2 


( 46 ) 


where t = 27, A = {A-i, A. 2 } are the eigenvalues of R and the periodic coefficients 
fij (r, x)\i, j = 1,..., 4 are expressed as 


fij(t, T) = a'o^ + 

rt = l 


cos 





(47) 


After experimenting with various sets of system parameters, it was observed that / = 15 
provided quite accurate representations of functions fij (At). This has been reported earlier 
by Pandiyan et al (1993). It is also consistent with the number of Fourier terms taken in 
the representation of the L-F transformation Q(t). 

In order to obtain a solution of equation (46) using TDNF, consider a near-identity 
nonlinear transformation 

yi=u-\- suit, r)M^ + gi 2 it, t)u^v + gi 3 (r, t)uv^ -)- guit, r)v^ 

.... (48) 

y2 = v-^ 8l\{t, t)u^ -f g22it, t)uv^ + g23(A + glAit, t)v^. 


where the periodic coefficients (A t); /, 7 = 1,..., 4 are once again of the form given 
by (47) but with unknown constants and Substituting (48) in (46) and solving the 
resulting homological equation as described earlier, the unknown constants and bn can 
be evaluated. In situations when there is no resonance, the Fourier series assumed for 
gij{t, r) and its derivative are found to be convergent (Arnold 1988). On the other hand, 
if resonance takes place, the unknown constants of the corresponding periodic coefficient 
cannot be determined and some nonlinear terms remain even after the normal form reduc¬ 
tion. As long as there is no resonance, all nonlinear terms are eliminated and the reduced 
normal form is just the linear part of (46). Therefore, the solution of the nonlinear Mathieu 
equation in the original coordinates can be obtained by substituting back all the intermedi¬ 
ate transformations. Even when some of the nonlinear terms remain due to resonance, the 
resulting equation can still be used to provide many useful conclusions about the stability 
and dynamical behaviour of the system. Such procedures are described by Bruno (1989) 
and Hale & Kocak (1991). 

In order to obtain an approximate solution of (46) via the time independent normal form 
(TINF) theory, variations of the periodic coefficients of nonlinear terms are neglected in 
comparison with their predominant means. This approximation results in an equation of 
the form 


{^4 ~ 

'a., 0 ] 

, 

■ + ■ 

«37 +^477 +^5>'72 

17) 

—1 

0 
_ 1 

1 ^ 2 ] 


,*3^1 + + hyl 


^re y\ and y 2 are the approximate states and aj,bj, j = 3,..., 6 are the means of the 
iodic coefficients of (47). To apply the TINF theory, consider a nonlinear near-identity 
^sformation of the form 

yi = M -h p[U^ -f p2U^v -H p2uv'^ -I- p4V^, 

~ 9 9 

y9 = y + q\u^ -I- <72w y + + q4V^, 


ore Pi and qi are unknown constants. Substituting (50) in (49) and solving the resulting 
onomous homological equation, (49) can be reduced to a linear form in most of the 
Nations except when resonances due to nonlinearity occur. The approximate solutions 
t±iis case can also be obtained in a fashion similar to the procedure discussed above. It 
ilso observed that as long as the system has eigenvalues that are distinct with negative 
positive real parts-, one could completely reduce the system to a hnear form. 

A.t this point an application of the traditional averaging procedure to (43) is briefly 
cussed. Assume a generating solution of the form {cf Sanders &. Verhulst 1985), 

x(t) = zi{t) coscDQt smajQt, (51) 

CDQ 


lere cDq = a and Zi,i = 1,2 are the slowly varying coefficients of the solution. Using this 
Lution in (43) and averaging over the principal period In Jo), one gets a set of quasilineai* 
tonomous differential equations with cubic nonlinearity in the averaged coefficients z as 


Z2 


R 


11 


dO 

^21 


12 


22 



^3 21 + 1 ^2 + “^5 ^ 1 ^2 + ^6 ^21 

^3^1 +Ml^2+^5^U2+^6^2r 


(52) 


lere 5/ and bi,i = 3 ,..., 6 are the constants depending on the parameters of the system 
ci Rfj are the elements of the constant matrix R^. However, the eigenvalues of the matrix 
^ will be quite different from those appearing in (49). The solution of the above equation 
n be discussed via the TINF theory. 


2, Case studies 

the following, several case studies are presented by selecting various sets of parameters 
<5 and e in the Mathieu equation given by (43). The parameter o) is selected as In in 
L cases of this study. 


dse 1. Trajectories in stable manifold {a ^ 0): Parameter set 1: a — 10.0; ^ — 0.2, 
= 1.8974; 6 = 0.3; Note that the parameters ^ and 6 multiplying the periodic and 
e nonlinear terms respectively are selected small. The suggested approaches, including 
c traditional averaging, are applied to this set and the results, along with the numerical 
>lution, are presented in figure 1. It is understandable that all the methods predict the 
ihaviour of the system correctly due to the smallness of the parameters ^ as well as e. 
z>T brevity, the coefficients of near-identity transformations for the normal forms are not 
scorded in this paper. 

cirameter set 2: a = 0.5; ^ = 4.0; 5 = 0.4243; € = 0.3; For this case, the parameter 
lultiplying the periodic term, is selected to be 8 times larger than a. From figure 2, 
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x,(t) 

Figure 1. Comparison of solutions of Mathieu’s equation with cubic nonlinearity; a = 10, 
=0.2, 5 = 1.8974, e =0.3. 


it is observed that except for the averaging method, all other techniques predict similar 
trajectories which finally approach the fixed point (0, 0). 


Case 2. Equation without generating solution (a = 0): Note that when o; = 0 the 
fundamental frequency coq of the autonomous part of (43) is zero and without a generating 
solution the averaging approach cannot be applied. 

Parameter set: a = 0; ^ = 4.0; 5 = 0.4243; e = 3.0; This set shows that the behaviour 
of the system is well predicted by the suggested techniques when compared with the 
numerical solution even though the nonlinearity parameter e is 10 times larger than the 
value used in set 1. The comparison is shown in figure 3. 

The coefficients of the near-identity transformations for the TDNF as well as TINF 
methods are once again omitted for brevity. 

Case 3. The centre manifold case: The application of the suggested techniques to the 
special case when the resonance condition prevails due to the presence of a pair of purely 
imaginary roots is shown in this part. For this case, by applying normal forms, all the 
non-resonant terms are annihilated, however, the homological equation corresponding to 
the resonant terms cannot be resolved. Therefore, some of the nonlinear terms of third 
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Figure 2. Comparison of solutions of Mathieu’s equation with cubic nonlinearity; a = 0.5, 
= 4.0, (5 = 0.4243, € = 0.3. 


degree stay in the reduced equation and hence, in general, a closed form solution is not 
possible. Under such circumstances it may be possible to neglect the periodic variations 
of the nonlinear coefficients and still retain the stability behaviour of the system. In the 
following, a case study is provided to demonstrate the apphcations in similar situations. 
Parameter set: a = 4.0; f = 7.468; 5 = 0; e = 0.3; For this set of parameters, applica¬ 
tions of the L-F transformation, the Jordan canonical transformation and the near-identity 
transformation (c/ (48)) to (46) results in an equation of the form, 



where, /i 2 (f, t) and fi^it, r) are complexified periodic functions corresponding to the 
resonant terms and iitwo = ±j 0.816 are the eigenvalues of the system. Multiplying u by 
V and V by M and adding, a linear differential equation in (mv) can be obtained. Therefore, 
the analytical solution of (53) can be found in (uv) which is of the form 


uv = —1 



a(X, t)dx, 


(54) 


where a (f, r) is a complex Fourier function. The differential equation (53) can be decoupled 
by substituting the solution (54) into (53) and the resulting linear differential equation in 










x,(t) 

Figure 3. Comparison of solutions of Mathieu’s equation with cubic nonlinearity: a = 0, 
=4.0,3 = 0.18974, € =0.3. 


u can be shown to be of the form, 


u= \ -fcuo + e/i2(r, r) 




U, 


i= —fcuo + e ^ Cnexp(i2jrnf/T) u, 


where are complex constants of Fourier expansion of the periodic terms and hence 


w = exp —ia)Q-\-€ c„ exp(z27r?2f/r) t 


In a similar manner, v can also be computed. It should be noted that the stability of these 
solutions entirely depend on the real part of the constant coefficient of the time-varying 
function exp(i27rnt/r). The solution is stable and/or unstable depending on whether the 
constant has negative real part or positive real part, respectively. This observation has also 
been made by Rosenblat & Cohen (1980,1981) following an entirely different approach. 
When the real part is zero, the solutions are closed orbits and behave like limit cycles. A 
typical result is shown in figure 4. 
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Example 2. Double inverted pendulum with parametric excitation 

In this example, the bifurcation of a two-mass inverted pendulum subjected to non- 
k conservative periodic load is discussed. The nonlinear equations of motion of the system 
are of the form (Jin & Matsuzaki 1988; Pandiyan & Sinha 1995) 

0i=-O.5(J5i -t- 252)01 + ^202 + 0.5^(p - 3)01 
4- 0.5/:(2 - p)02 - 0.5(0^ -f- - 02) 

~(pA/12){(0i - K02)^ - (1 - y)30|} - ((01 - 02)V4) 
x{/:(p - 4)01 -f- ^(3 -I- p{y - 2))(j>2 - (^i + 352)01 + 35202}> 

02=O.5(5i 4- 452)01 - 25202 + 0.5(5 - p)^0i 

4- {(5(1.5 - y) - 2)/:}02 -H O.5(0i - 02)(30^ 4 01) 

. 4-(_5^/12){(0i-y02)3-3(l-y)303}-h((0i-02)V4) 

4 X{^(25 — 7)01 4^(5 4 5(y - 3))02 - (25i 4 552)01-b 55202), 

where m is the mass, I is the length of the links of the pendulum, 0i and 02 the displacement 
angles, 01 and 02 are the corresponding rates, y(0 < y < 1) is the load-direction parameter 
and P = Pi 4 P 2 coscvr. Other symbols appearing in (57) and (58) are defined as h\ & 
b 2 == damping parameters, 5i = b\/mP‘, B 2 = b 2 /mf-, p = Pl/mf-, k = k/ml^ 



k = stiffness parameter, Pi = magnitude of static load, Pi = amplitude of the dynamic 
periodic load. Equations (57) and (58) are rewritten in the state-space form as 


71' 
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0 “ 

72 
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0 
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73 


0.5k(p 

- 3) 0.5k{2 - p) 

-0.5(Pi +252) 
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.74. 


_0.5^(5 

- p) fc[p(1.5-y)-2] 

0.5(51+452) 

-252_ 
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^2 

73 

74 


0 

0 

[ - 0.5(y2 + y2)(3;^ _ y^) _ (pk)[(yi - yyi)^ 

-a - y)^72]/12 - 0.25(yi - y2)Hk(p - 4)7i 
-i-^(3 -I- p(y - 2))y2 - (Pi + 3^2)73 + 35274]] 
[0.5(yi - y2)(373 + 74 ) + P^[(7l - 772)^ 
-3(1 - y)3y3]/i2 + 0.25(yi - y2)^[^(2p - 7)yi 
-l-fe(5 -1- piy - 3))y2 - (25i + 552)^3 + 5 5274]] 


where {yi, y2i 73i 74 } = {<t>u <h.^ <Pl,<p2}- In the following, the dynamics of a primary 
single Hopf and a single flip bifurcations of the above 4-dimensional system is discussed 
via centre manifold principle by reducing the problem to a two and a single dimension 
respectively. 


(i) Hopf bifurcation - For the parameter set, k = 2.0, B\ — Bi— 0.016, Pi = 0.5, P 2 = 
0.966, y = 0.8, = 1.0, (46) yields a pair of complex Floquet multipliers with modulus 

one which corresponds to a single Hopf bifurcation. After normalizing the time with 
(OT = 27rt, the L-F transformation corresponding to (59) is computed. The application of 
this transformation to (59) leads to the following dynamically equivalent Jordan canonical 
form. 


71 

72 

73 

74 


■-0.17791 -b 1.14613 i 
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0 
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0 0 

-0.17791 - 1.14613 i 0 

0 0.3391 i 

0 0 


0 
0 
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-0.3391 ij 
(60) 
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i:^,it)y^^yTyTyT 

'LS{t)y^h2^yTyT 
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1=1 


i:^,(t)yTy2^yTyT _ 

where (r), (t), (r) and (r) are the complex vector periodic coefficients consisting 

of 31 elements (this corresponds to the number of Fourier terms taken in the expansion of 
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L-F transformation matrix) with period 2T corresponding to all possible monomials of 
order 2 m y\, y 2 , y^ &. y^. Note that two of the eigenvalues in (60) are purely imaginary 
which is to be expected in this case. The centre manifold relations for this problem are 
assumed in the form 


yi = ^2\(^)y3 + + ^ 23 (^) 3 ' 33'4 + ^ 24 (^) 3^4 






such that Bfj(t), i = 1,2 &, y = 1, 2, 3, 4 are unknown periodic coefficients with period 
IT. Note that in the above equation the states corresponding to stable eigenvalues are 
expressed in terms of the states corresponding to the critical eigenvalues. 

Substituting (61) in (60), eight ordinary differential equations in Bfj(t), i = 1,2 & 
j = 1, 2, 3, 4, similar to (40) are obtained. The periodic coefficients appearing on the right 
hand side of these differential equations are nothing but the known periodic coefficients 
corresponding to the cubic nonlinear terms appearing in (60). The unknown periodic 
coefficients Bfj{t), i = 1,2 Sc j = 1, 2, 3,4 can be obtained by formally solving these 

differential equations. In order to obtain a particular solution, Bfj (t) is assumed in the form 
of (41) with unknown constant coefficients and like terms on both sides of the equations 
are equated to obtain a set of linear algebraic equations in terms of the unknowns and 
bn - The computation of all the unknowns of the Bfjitfs requires the solution of a set of 
8x31 linear algebraic equations. These algebraic equations can be solved such that each 
of the Bfj(t), i = 1, 2 & j = 1, 2,3,4 can be obtained as Fourier series expansions. 
Noting that the problem under consideration consists of only cubic nonlinearities, it is not 
necessary to solve for all the periodic coefficients Bfj(t),i = 1, 2 & j = 1, 2, 3,4 in the 
centre manifold relation. Instead, it suffices to compute only one coefficient per relation 
in (61). This simplification does not affect the final outcome of the result, since the centre 
manifold relations result in nonlinearities which are of powers greater than three and does 
not influence the stability characteristics. Therefore, for this case, only coefficients B^^ (t) 
and B 21 (t) are computed. 

Substitution of centre manifold relations (61) in (60) results in differential equations for 
the critical states ys and y 4 which contain nonlinearities of cubic and higher orders. Since 
the higher order terms do not affect the stability characteristics, the terms of order higher 
than 3 are neglected. The equations thus obtained are similar to (46) representing a Hopf 
bifurcation behaviour in a two-dimensional system with cubic nonlinearity. Following the 
procedure outlined earlier in the study of Hopf bifurcation of a single degree of freedom 
system, the application of time-dependent normal forms to these equations provide a 
simplified nonlinear equation similar to (53). The behaviour of the fixed point of the 
resulting equation is found to be a centre by employing similar methods to those outlined 
by Pandiyan (1994). For brevity, the calculations are not reported here. On the basis of 
the arguments presented for example 1, the motion resulting from the Hopf bifurcation 
is quasi-periodic and bounded. It can readily be seen that the Poincare plots provided in 
figures 5a and 5b also confirm this result. 
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Figure 5. (a) Poincare plots - Double inverted pendulum under Hopf bifurcation. 


—0.999938—1) and the system undergoes a flip bifurcation. After the transformation, 
the following Jordan canonical form is obtained 


' jr 
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(62) 


where (r), (0, Cg (t ) and (t) once again are vector periodic coefficients with period 
2T . It is observed that the eigenvalue corresponding to the third state is zero and the 
remaining eigenvalues have negative real parts. The centre manifold relations for this case 
can be assumed in the form 


yi=B3i(0y|; y2 = B5,(t)y|; y4 = B|i(t)yi, (63) 

where Bj|(0, B^jCO and B 3 j(f) are unknown coefficients with period 2T. These can be 
determined by solving the differential equations in B ^ ^ (t), (t) and B^j (r) as described 
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(b) Poincare plots - Double inverted pendulum under Hopf bifurcation. 


earlier. The computed coefficients are recorded by Pandiyan (1994). Substituting (63) into 
equation (62) and neglecting the higher order terms beyond the cubics, the one-dimensional 
centre manifold equation is found and since the mean value of the periodic coefficient of 
the reduced centre manifold equation for this case is positive, the fixed point is unstable 
and hence the corresponding 2T periodic orbits in the original coordinates are unstable. In 
f a similar way, the fold bifurcation of this system can also be studied. However, the results 
are not included here. 
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Abstract. This paper highlights the important results of the investigations 
on Electrochemical Discharge Machining (ECDM) conducted by the author 
and his coresearchers. It has been found that “switching phenomenon” plays 
a crucial role in spark generation and not the straightforward breakdown of 
the non-conducting vapour blanket. The mechanism of spark generation has 
been understood reasonably well and inductance in the circuit has emerged 
as an important process parameter from the investigations. This information 
has been effectively used to improve the process capability of ECDM by a 
substantial amount. It has also been shown how ECD can be very conveniently 
used for micro-welding operation without using any sophisticated arrangement. 

Keywords. Electrochemical discharge machining; switching phenomenon; 
spark generation; inductance; micro-welding. 


Introduction 

nufacturing is now recognized by all sections of our society as one of the most important 
.vities since it affects very significantly the GNP, international trade and employment, 
nufacturing is broadly defined as 

“an activity by which material, labour (including information), energy and machines 
are brought together to produce a product whose value is more than the total value 
of the individual inputs”. 

s is schematically indicated in figure 1. However, this definition is too generalised and 
5 can be more specific stating that 


Value 



Figure 1. Value addition in manufactur¬ 
ing activity. 


“Engineering industry manufacturing” is what we mean by “manufacturing” in the fieM 
of Mechanical Engineering. Figure 2 suggests that almost one third of the production in 
India comes from such activity and, therefore, “mechanical manufacturing” represents a 
very vital pillar of our economy. 

“Machining” has a special status among all processes covering the whole spectrum of 
“mechanical manufacturing” because of its capabilities as below. 

• High accuracy and finish can be achieved. 

• Complex shapes can be produced. 

• Machining force and power are independent of the work size and shape as all machining 
processes are basically generating processes. 

• The property of the bulk of the work material remains unaffected. 

Machining has also been one of the earliest manufacturing processes and is continuously 
evolving to cope with the new challenges faced by the manufacturing industries. 


All Industries (100 V.) 




All Manufaclurlng Industries (77 %) 




Ail Engineering 
industries (31%) 


Figure 2. Proportion of engineering 
industries. 
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Figure 3. Family of machining processes. 


In machining, the required shape, size and finish are obtained by removing excess 
material in a controlled manner in the form of small chips. In conventional machining 
removal is in the form of small chips and in many advanced processes chip size is extremely 
small. Removal of excess material can be achieved in various ways using different forms 
of energy as shown by the tree diagram in figure 3. 

The two most extensively used unconventional machining processes are 

• Electrochemical machining (ECM), and 

• Electric discharge machining (EDM). 

Considering the different aspects of their capabilities, these two processes are considered 
to possess maximum potential. However, inspite of their many virtues both these processes 
suffer from a very important limitation. To employ either ECM or EDM the work material 
must be electrically conducting. The development of electrochemical discharge machining 
has taken place in recent years primarily to eliminate this difficulty. 


2. Electrochemical discharge machining 

A study of all machining processes indicates that any scheme for removing material in the 
form of small particles in a controlled fashion can be used for the shaping of objects. To 
circumvent material and shape problems, quite often the approach adopted for machining 
is to cause the melting of a small portion of the workpiece by means of intense localized 
heat generation. By controlling the location of the heat source in a proper way, the required 
shape of the workpiece can be achieved. 




Power supply 

4 - - 


Lorge electrode 



Figure 4. Schematic diagram of BCD set-up. 


One phenomenon used to produce intense localized heat generation without using any 
sophisticated technology like electron beam or laser beam is the electrochemical discharge. 
Figure 4 shows an electrochemical cell where one electrode is very small and the other 
is relatively much larger. When these electrodes are connected to a voltage source (either 
AC or DC) an electrical discharge can be seen at the tip of the smaller electrode if the 
supply voltage exceeds a critical value. This is known as electrochemical discharge which 
takes place between the tip of the smaller electrode and the electrolyte in its immediate 
neighbourhood. 

The temperature at the discharge zone has been measured by thermocouple technique 
(Basak 1991) and the result is shown in figure 5. This is, of course, the temperature of 
the thermocouple junction which itself happens to be the tip of the smaller electrode. The 
temperature of the discharge itself has been estimated by emission spectroscopy (Reghuraixi 
1994) for different electrolytes with varying concentration. The temperature is found to be 
in the range 8000-10 000 K. 

Thus, it is possible to cause material removal in very small quantities by melting and 
evacuation if the workpiece is kept very near the tool tip and within the range of the dis¬ 
charge. The work material need not be electrically conducting in this case. Such a scheme 
is presently in the process of development and this process is termed “Electrochemical 
Discharge Machining (ECDM).” 

The scheme of the basic ECDM process is indicated in figure 6. The tip of the tool 
electrode is very close to the work surface (electrically nonconducting) and small holes 
can be drilled by this process. Slots can be also cut by using a knife edge tool electrode. 
Slots or grooves of any shape can be machined by controlling the lateral motion of the tool 
tip of a pointed tool. 

3. Mechanism of ECD phenomenon and ECD machining 

This process, though it appears to have considerable potential for machining electrically 
non-conducting materials, has very limited acceptance mainly because of its limited ca¬ 
pacity. To improve the process capability of ECDM it is essential to understand the basic 
mechamsm of ECDM and identify the process parameters correctly. 

The process is extremely complex involving electrochemistry, heat transfer and boiling, 
melting and evacuation. To understand ECDM, first the phenomenon of ECD should be 
investigated. For this purpose an extensive study has been carried out. One of the important 
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(b) 

Figure 5. (a) Set-up to determine electrode temperature, (b) Result obtained. 


findings is the nature of the potential drop across the electrolytic cell. A typical variation is 
indicted in figure 7 (Allesu 1988). It is clear that an overwhelmingly large fraction of the 
total drop takes place across a very thin layer surrounding the tool electrode (the smaller 
one) during electrochemical discharge. This suggests that a resistant layer develops around 
the tool electrode. In most experiments this is the cathode and, therefore, H 2 evolves in 
the form of very fine bubbles. Moreover, the temperature of the electrolyte at this region 
increases because of much higher current density. This leads to boiling if the current density 
is high enough. Both these result in the development of a non-conducting gas and vapour 
blanket around the smaller electrode (this does not happen at the larger electrode because 
the current density is much less). When the voltage (either full-wave rectified DC or smooth 
DC) applied across the BCD cell is gradually increased, the corresponding change in the 
average current is shown in figure 8. An extensive study shows (Allesu 1988; Basak 1991; 
Allesu et al 1992; Reghuram 1994) that these characteristics remain unchanged for all 
situations. It is seen that electrochemical discharge starts only when the applied voltage 
reaches a critical value which depends on the type of electrolyte and its concentration. The 



depth to which the electrode is submerged and the electrode diameter do not appear to 
have much effect as long as the discharging electrode is much smaller than the other one. 

It is also very interesting to note that the power consumption (for a given electrolyte and 
set of electrodes) at the critical condition (= Vcr-hr) remains constant and is independent 
of electrolyte concentration (Basak 1991). This strongly suggests that blanketing can be 
a major factor in the occurrence of electrochemical discharge as both H 2 evolution and 
vapour production through boiling depend on the power only. 

The conditions under which blanketing takes place have been investigated and a theo¬ 
retical model has been developed for the BCD mechanism and for ECDM (Basak 1991; 
Basak & Ghosh 1995) which can predict the critical voltage, critical average current and 
also the material removal rate under prescribed conditions. The results agree reasonably ^ 
well with the experimental observations. This investigation also provides a new input 
which suggests that the electric discharge is primarily due to (Basak 1991; Basak & Ghosh 
1995) switching action and not to the breakdown of gas in the blanket as suggested by the 
previous researchers. 

Figure 9 shows the various stages of bubble coverage of the tool surface as the apphed ' 
voltage is gradually increased. At the critical condition, the hemispherical bubbles cover the \ 

surface completely. In this closed-packed condition, the in-between conducting bridges are I 

very narrow and the current densities in these bridges become very high and cause instant 
boiling which blows the bridges. When a bridge is blown a spark is generated by switching 
action. This is why sparking takes place even if the applied voltage is much lower than ^ 

the minimum voltage for breakdown as indicated in the Paschen curve. Figure 10 shows | 

Ae equivalent circuits. As switching action plays the crucial role in spark generation, I 
inductance in the circuit becomes one of the most important parameters. This however 
remained unnoticed so far. This finding thus leads to an extra process parameter which can 
be easily controlled and a substantial improvement in the process capability of ECDM can 
be achieved. 






Figure 7. Distribution of voltage drop in an ECDM bath [Electrolyte: NaOH (35%); tool 
polarity; -ve; power supply: DC]. 



Voltage (v ),RMS 


Toot dia; 0-11 cm Tool depth ;=0 2 cm 
Power supply : Smooth D.C. 


Figure 8. V-I characteristics for differ¬ 
ent electrolytes. 
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Figure 9. Bubble density on tool elec¬ 
trode at different applied voltages. 
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Figure 10. (a) Discharge locations with bubble distribution at critical condition, (b) Ide¬ 
alised switching off situation, and (c) Idealised equivalent circuit at discharge. 
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Tool dio = 0.16 cm Observed 

Machining time = I min -Colquloled 

Power supply * Rectified (full wove) 

D.C, 100 Hi.. 45 V 

Figure 11. Total inductance vs MRR (NaOH) (log scale, workpiece - glass). 

4. Improvement of process capability 

A number of experiments on ECDM have been conducted using artificially introduced 
inductance in the circuit (Basak 1991; Basak & Ghosh 1995b). It has been observed that 
increasing the inductance in the circuit results in higher material removal rate. The increase 
is quite significant and a set of sample results is shown in figure 11. Further investigation 
can provide more interesting results in this direction. 

The idea of using artificially introduced inductance in an ECDM circuit has been put 
forward (Reghuram 1994) in a somewhat different direction. It has been found that when 
smooth DC voltage is applied, the introduction of a series inductance of adequate value in 
the circuit can lead to a “total discharge condition.” In this condition the discharge appears 
like a flame emerging from the tool tip. To obtain a stable total discharge without the 
undesirable disturbance in the electrolyte, the input voltage is increased to a high value 
100 V) to obtain stable discharge. Once the stable discharge is attained the input voltage 
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is lowered to the operational value 60 V) and it is found that the energy input to the 
system is much higher. 

Under this situation it is not necessary to keep the workpiece in contact with the tool 
tip as is done in the common ECDM. It is enough to keep the work surface within the 
range of the stable discharge; the process can be considered to be somewhat analogous to 
laser-beam machining. However, gravity feed is no longer possible in this type of ECDM 
and the tool has to be fed by a servo drive as is done in EDM. Figure 12 shows that by 
employing this modified ECDM holes can be drilled in a quartz block with a penetration 
rate of 0.3 mm/min and can be continued even if the tip reaches depth of the order of 3 mm 
or more. This has not been possible with the usual ECDM without a series inductance. 

5. Other manufacturing applications 

At this point it is worth mentioning a few other applications of ECD in manufacturing 
processes. Using this phenomenon a scheme for scribing on non-conducting materials has 
been developed (Allesu 1988). A scribing pen to write on glass and ceramics was fabricated 
and it performed satisfactorily. 

Another important manufacturing application of ECD phenomenon besides machining is 
microwelding. It has been shown (Allesu 1988) that micro-welding of a fine thermocouple 
can be very conveniently performed using the heat generated by ECD. The scheme is 
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Switch 


E lectrolyte 




Figure 13. Schematics of micro-welding set-up. 


very simple and is shown in figure 13. The process has been investigated in detail (Parija 
1993; Ghosh et al 1995) and it was possible to determine the optimal combinations of the 
process parameters for obtaining excellent welds. Figure 14 shows the suitable regions of 
the voltage-current combination which yield good results. The work is still in progress. 

Since a controlled localized heating is possible with the help of electrochemical dis¬ 
charge other important processing methods can be developed. Surface treatment using the 
pointed heat source is one such example. A more interesting application of the ECD phe¬ 
nomenon can be “Rapid Prototyping” with fused deposition of very thin metallic wires as 
indicated schematically in figure 15. Currently rapid prototyping using metals instead of 
polymers, resins, plastics etc. are done using high power lasers and the system is expensive. 
A low budget metallic fused deposition modelling type rapid prototyping may be possible 
using ECD. 

6. Concluding remarks 

The present investigations indicate that ECD can be a very useful tool for different types 
of manufacturing processes. The mechanism of spark generation is found to be primar¬ 
ily the switching phenomenon rather than the breakdown of insultaing gas layer. This 
understanding resulted in a major improvement of the process capability by artificially 
introducing a suitable amount of inductance in the circuit. Further research resulted in a 









Figure 14. Nature of voltage - electrolyte concentration in microwelding. 


Carriage driven by 



Servodrive for the y- direction not shown 
fignre 15. Scheme of ECD fused deposition modelling. 
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D phenomenon has been very effectively used for micro-welding. ECD micro-welding 
very simple process and can be useful to the industry. Similarly there is a great poten- 
for ECD in other types of material processes and fused deposition modelling for rapid 
totyping. 
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Abstract. The current focus of manufacturing research on fibre-reinforced 
plastics (FRP) is composed of the search for efficient processing techniques 
capable of providing high quality machined surfaces. Very limited work has 
been performed to identify the influence of manufacturing processes like edge¬ 
trimming and drilling on material performance. Recent reports suggest that 
process-induced damage may affect the mechanical behaviour of FRP materi¬ 
als. Therefore an experimental study of orthogonal cutting was conducted on 
the edge trimming of unidirectional and multi-directional graphite/epoxy com¬ 
posites with polycrystalline diamond tools. The effects of tool geometry and 
operating conditions were evaluated from an analysis of chip formation, cutting 
force, and machined surface topography. All aspects of material removal were 
found to be primarily dependent on fibre orientation. Discontinuous chip for¬ 
mation was noted throughout this study, regardless of machining parameters. 
Three distinct mechanisms in the edge trimming of fibre-reinforced composite 
material including a combination of cutting, shearing, and fracture along the 
fibre/matrix interface were observed. An investigation conducted on the com¬ 
pression, flexural and impact strength of graphite/epoxy composites machined 
by both traditional and non-traditional techniques, confirms that manufactur¬ 
ing characteristics may not only affect bulk properties but also influence the 
initiation and propagation of failure. 

Keywords. Fibre-reinforced plastics; manufacturing processes; process- 
induced damage; graphite/epoxy composites. 


Introduction 

2 -reinforced plastics (FRP) are a contemporary class of multi-constituent materials 
:h exhibit superior strength and stiffness-to-weight ratios in comparison to homoge- 
is counterparts. Consequently, polymeric composites are quickly gaining prominence 
e aerospace, automotive, and other high technology industries, where both weight and 

449 


450 


M Ramulu 


performance govern design constraints. The ability to develop complicated part shapes 
and align fibres to accommodate for critical load orientations has facilitated the design and 
production of high-performance aircraft using composite materials in excess of 50 percent 
(Kinkaid 1988). Construction of commercial and military aircraft composed entirely of 
composite materials is imminent (Riggs 1984; McCarty 1993). However, specific details 
concerning post-mold manufacturing technology utilized for FRPs inhibit their use in new 
designs and in the development of new products. Contrary to net-shape manufacturing 
of homogeneous materials, FRPs are traditionally molded to near net shape with compli¬ 
cated patterns incorporating specific geometry and laminae orientation, then cured with a 
combination of temperature and pressure. Secondary processing such as trimmingdo final 
shape and drilling for fasteners is typically required to facilitate component assembly. 

Surface roughness and tolerance are closely related, and it is generally necessary to 
specify a smooth finish to maintain a fine tolerance in the finishing process. For many 
practical design applications, tolerance and strength requirements impose a limit on the 
maximum allowable roughness. The reliability of machined components, especially for 
fibre-reinforced plastics (FRP) in high strength applications, is often critically dependent 
upon the quality of surface produced by machining, and the surface layer may drastically 
affect the strength and chemical resistance of the material. However, inhomogeneity of 
FRPs, caused by the difference in the properties of the fibre and the matrix materials, will 
have a machined surface that is less regular and is usually rougher in comparison with 
machined metal surfaces (Koplev etal 1983; Kdnig etal 1985; Abrate & Walton 1992a). 

Traditional methods of machining often induce critical flaws in the component paits 
during net trimming, and various degrees of delamination, splintering, fibre pullout, and 
cracking have been reported (Koplev et al 1983; Konig et al 1985; Ho-Cheng & Dharan 
1990; Abrate & Walton 1992a; Colligan & Ramulu 1992; Wang et al 1992). Due to 
the limitations of traditional processes, many non-traditional manufacturing methods have 
been examined for net shape machining of composites (Koplev et a/1983; Konig et a/1985; 
Ho-Cheng & Dharan 1990; Abrate & Walton 1992a; Colligan & Ramulu 1992; Wang et al 
1992). The abrasive waterjet (AWJ) is one such method which appears to be highly suited 
for production trimming of FRP materials (Ramulu & Arola 1993, 1994). Owing to its 
ability to achieve a quality surface at rapid production rates, AWJ is currently being sought 
for production applications (Mortimer 1987; Miller 1991). However, very little is known 
regarding the effects of AWJ machining on the surface integrity of FRPs, or the influence 
of post-mold manufacturing defects on mechanical performance in general. Considering 
the present and future applications of FRP materials to commercial aerostructures and 
other related industries, it is necessary to obtain a fundamental understanding of post¬ 
mold manufacturing process effects on the structural integrity of fibre-reinforced materials. 
Investigating by machining-incurred damage to mechanical performance and service life 
is imperative to the rehability of composite structures. 

The choice of net trimming or drilling technique is often based on a combination of 
process economics and visual aesthetics of the machined surface. Although important, 
these factors are not appropriate for the choice of a manufacturing method considering the 
probable influence of secondary operations on the performance of composite materials. 
Surface roughness characteristics and surface integrity of machined FRP composites in 
general, and graphite/epoxy in particular, have not been studied in detail. Therefore, recent 



s concerning machining-induced edge effects on the structural performance of FRPs 
/iewed and results from an investigation on machining mechanisms and machining 
id surface integrity of graphite/epoxy will be presented. 


ackground 

Machined quality of fibre-reinforced plastics 

factured surfaces are evaluated using a number of different methods, each of which 
lique quantitative features corresponding to the machined quality. The choice of 
;tion technique may depend on a variety of criteria including available equipment, 
s^estigator, and other terms in which quality control guidelines are established. Me- 
)al performance of homogeneous material component parts has been shown to be 
:tion of the residual stress induced in the material and the surface topography, 
reinforced plastics do not develop residual stress by applied forces. Therefore, the 
'f of machined FRP parts is often interpreted by surface profilometry and visual 
ques. 


Surface morphology: Many methods have been used in evaluating the quality of 
rimmed surfaces such as the use of average roughness parameters, c-scans, and other 
techniques. Some of these methods and associated quality criteria may be insuffi- 
for distinguishing features that influence part performance (Philips & Parker 1987; 
old et al 1989; Jamil & Chambers 1991; Wem 1991; Ramulu et al 1993). For in- 
i, visual inspections are often used to identify the degree of damage to a component 
tid also to discuss prevalent topographical features. However, visual documentation 
les little information that can be used in a quantitative analysis or non-subjective 
arison. Ramulu and Wem (Wem 1991; Ramulu et al 1993) studied the suitability of 
ird roughness parameters in describing the topography of unidirectional and multi- 
ional graphite/epoxy (Gr/Ep) trimmed with polycrystalline diamond (PCD) tools, 
js statistical and random process methods were used in addition to standard rough- 
>arameters to analyse the trimmed surface characteristics. In this study it was found 
ithmetic average surface roughness (Ra), which is often used as a parameter for qual- 
ntrol, failed to describe the relative extent of surface variation observed from visual 
us. More appropriate parameters for describing topographical features of the inho- 
neous material surface include the peak-to-valley height (Ry) and ten-point height 
Cumulative height distribution (CHD) on a probit scale and power spectrum density 
) function were also found very useful in characterizing the spatial distribution of the 
rimmed graphite/epoxy material. However, one drawback of a profile based descrip- 
) that matrix smearing resulting from traditional edge trimming obliterates details of 
named surface (Wang 1993). This implies that even in rigorous quality inspection of 
raphy, some features remain unnoticed that may influence the stmctural integrity of 
iponent part. Therefore, it is often desirable to supplement quantitative descriptions 
led by profilometry with a visual analysis of the surface morphology. 
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2.1b Microstructural integrity: Post-processing microstructural integrity refers to the 
relationship between fibre and matrix after machining. Research addressing microstructural 
integrity resulting from post-mold processing of FRP’s is limited. Preliminary work has 
shown that AWJ machining of Gr/Ep does not affect the interfacial relationship between 
constituents whereas studies on traditional trimming and laser cutting of FRP’s suggests f 
that the fibre/matrix interface may be altered (Tagliafemctfl/1985; Wem 1991;Ramulu& 
Arola 1993). Microstructure of the machined surface and accumulation of process-induced : 
damage could prove crucial to the structural integrity of FRP’s, especially when subjected to 
cyclic loading. In an assessment of post-mold processing influence of composite materials, 
microstracture should be examined and correlated with mechanical strength. 

2.2 Machining-induced edge ejfects on structural integrity 

The design of a FRP component is typically based on choice of constituents, distribution 
and orientation of the laminae, and the geometry chosen to accommodate for service loads. 

In addition to these parameters, the effects of manufacturing and associated defects must 
be considered. 


2.2a Drilling effects: A handful of studies have been reported on the influence of drilled 
hole quality to the stmctural integrity of FRP materials (Wood 1978; Pengra & Wood 1980; 
You & Chou 1988; Ghasemi Nejjad & Chou 1990; Tagliaferri etal 1990; Lin & Lee 1992; 
Mehta et al 1992). Some alternatives to conventional drilling like molding holes prior to 
the curing process have been suggested to improve part strength of components requiring 
fastener holes. For instance, in a comparison with molded holes, it was recently shown 
that laminates with drilled holes exhibit lower tensile, compressive, and bearing strengths 
(You & Chou 1988; Ghasemi Nejjad & Chou 1990; Lin & Lee 1992), with a reduction 
ranging between 20 and 70%. The superior strength of panels with molded holes has been 
attributed to fibre continuity, an increase in fibre volume fraction near the hole vicinity, and 
the absence of matrix microcracks induced during drilling (Ghasemi Nejjad & Chou 1990). 

There are remarkable discrepancies between results from different investigations re¬ 
ported on the influence of drilling-induced hole quality to static and fatigue strength of 
FRP panels (Wood 1978; Pengra & Wood 1980; Tagliaferri et al 1990; Mehta et al 1992). 
Wood (1978) concluded that hole quality does in fact influence the fatigue strength of com¬ 
posite panels but does not noticeably influence the tensile strength. Pengra & Wood (1980) 
similarly concluded from experimental work that restrictions on chipout defects (pitting) 
and oversize holes could not be relaxed (from definitions in 1980) but suggested that quality 
control requirements for exit delamination in graphite/epoxy may be too stringent. On the 
other hand, Mehta etal{\ 992) found that hole quality in PMR-15/Gr laminate, particularly 
including exit ply delamination, may influence both tensile and compressive properties of 
panels with drilled holes. Tagliaferri et a/ (1990) concluded that although hole quality does 
not influence the tensile properties of composites, it has significant influence on bearing 
strengfli. Although results from these studies are inconclusive, they unquestionably suggest 
that drilling-induced fastener hole quality influences the mechanical behaviour of FRP ma¬ 
terials. Since drilled-hole anomalies are consistent with those generated during traditional 
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ge-trimming of FRP’s, it should be rationalized that net-trimming operations and man- 
icturing induced defects may influence the structural integrity of FRP component parts. 

ib Edge finishing effects: In addition to the choice of constituents, one of most im- 
rtant parameters affecting the mechanical properties of a FRP laminate material is com- 
nent lay-up. Pipes and Pagano (1970) studied the influence of stacking sequence on 
5 development of interlaminar shear and interlaminar normal stresses in FRP laminates 
bjected to external loading. These phenomena are commonly referred to as “free edge 
bets”. Edge effects coupled with the presence of manufacturing defects on the trimmed 
ge may prove critical to component performance. Large magnitude interlaminar stresses 
the free edge necessary to compensate for force and moment equilibrium between plies 
creases the need for maintaining high edge quality during net-shape trimming. Edge trim- 
ng damage of FRP component panels could be critical to the stress transfer mechanisms 
the free edge. 

Howarth and Strong (1990) investigated the influence of machining induced edge effects 
abrasive waterjet (AWJ) and laser cutting on the strength of FRP materials. Tensile spec- 
ens of various widths were machined with both laser and AWJ methods from fiberglass, 
ivlar, and graphite fibre epoxy matrix composites. In tests conducted with Kevlar fibre 
mposites, tensile strength and modulus decreased with decreasing specimen width as- 
2 iated with the influence of manufacturing induced edge effects. However, from testing 
fiberglass composites, specimens machined with the laser decreased with decreasing 
icimen width but abrasive water jets specimen tensile strength increased with decreas- 
I specimen width. Graphite fibre composites machined with the AWJ showed the same 
nd, increasing tensile strength with decreasing specimen width. Laser machined Gr/Ep 
ecimens were not tested due the poor cut quality resulting from thermal properties of the 
iphite fibres. No explanation was provided to support the unusual experimental results 
d no subsequent studies have been reported. Sadat (1988), on the other hand, examined 
i influence of cutting parameters on process damage and resulting interlaminar tensile 
ength of unidirectional Gr/Ep composites machined with a 60 tooth high speed steel 
SS) slot saw. From an optical study of the machined surface, damage features in the 
rm of cracks, delamination, and fibre rotation were noted within the machined region, 
le extent of machining damage decreased with increasing cutting speed corresponding 
the decrease in cutting forces monitored in both the feed and normal directions. Fur- 
irmore, minimizing machining damage with high cutting speeds resulted in an increase 
the interlaminar tensile strength. 

Based on this brief review of machining effects on the surface integrity of FRP materials, 
appears that the choice of processing techniques as well as the process conditions used 
machining may be significant. However, only a hmited degree of work has been done in 
s area, and current knowledge of machining influence on surface integrity is insufficient, 
iditional investigations are necessary to enhance the understanding of manufacturing 
bets on FRP material performance. Therefore, an investigation on the surface integrity 
graphite/epoxy was conducted at the University of Washington was started about ten 
ars ago to study the influence of edge-trimming with poly crystalline diamond tools, 
rniond abrasive cutters, and abrasive waterjets on the compression and flexural strength 
graphite/epoxy composites. 
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3. Orthogonal machining 

3.1 Materials and procedures 

A unidirectional 4mm thick graphite/epoxy (Gr/Ep) panel with 3501-6 resin and IM-6 
fibres was used in this investigation. Aim square unidirectional panel was fabricated 
into test specimens with the desired fibre orientations for orthogonal trimming. Fibre 
orientations are defined clockwise with reference to the cutting direction as shown in 
figure 1. Unidirectional fibre orientations greater than 90° are typically regarded as negative 
orientations by convention. A multidirectional Gr/Ep laminate panel with 6 p.m fibre diam¬ 
eter, 0.68 volume fraction, and 200 |xm ply thickness was used for the edge-trimming ex¬ 
periments. Panel layup of [45°/-45°/(0°/90°/45°/-45°)2]5 eliminated stretching/bending 
coupling and resulted in less chance of delamination and debonding. PCD tool inserts in 
the trimming experiments were medium grade General Electric COMPAX 1300 with 7 |xm 
grain size. Various tool insert geometries were studied including 0°, 5°, and 10° rake angles 
(a) with 7°, and 17° clearance angles (y). Mechanical properties of the PCD material and 
unidirectional Gr/Ep are listed in table 1. 

A Rockford planer-shaper equipped with a hydraulic table to provide steady cutting 
motion was used for the orthogonal edge-trimming experiments. On-line dynamic cutting 


Table 1. Mechanical properties of PCD and Gr/Ep. 


Property 


Material 

Graphite/Epoxy 



0° 


90° 

Tensile strength (MPa) 

1378 


41 

Tensile modulus (GPa) 

117.0-138.0 


8.0-11.0 

Compressive strength (MPa) 

1309 


N/A 

Compressive modulus (GPa) 

107.0-124.0 


N/A 

PCD 

Knoop hardness (HK) 


3400-3700 


Compressive strength (GPa) 


700 


Modulus of elasticity (GPa) 


827-1103 


TRS (MPa) 


482-1723 


Thermal conductivity (W/m°C) 


50-92 
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)le 2. Edge trimming test matrix. 

Lphite/epoxy fibre orientation 0°, 15°, 30°, 45°, 60°, 75°, 90° 

D insert a 0° 0° 5° 5° 10° 10° 

Dmetry y 7° 17° 7° 17° 7° 17° 

ring speed 4, 9, and 14m/min 

Dth of cut 0.127, 0.254, and 0.381 mm 


iracteristics were recorded with a CCD Hi-8 format camera. Cutting forces were mea- 
•ed using a 3-dimensional circular-type strain gauge dynamometer attached to the tool 
5t. 

Machining tests were conducted following a full factorial design of experiments, in- 
Tporating the conditions listed in table 2. All machining was carried out under dry 
ling conditions conforming to industrial specifications. Methods of analysis included 
th in-situ and post-process measurements of machining characteristics. In-situ analysis 
hniques consisted of optical recording of chip formation and cutting force measure- 
;nts along the principal and thrust directions via the 3-axis dynamometer. All aspects 
chip formation from inception through chip release were recorded. Surface topography 
i machined-surface characteristics were examined using profilometry and SEM. Sur- 
:e roughness measurements were obtained by a Federal Surfanalyzer 4000 with a 5 |xm 
imeter diamond stylus probe, 0.8 mm cutoff length, 1.25 jxm measurement interval, and 
1.5 mm sampling length. Microstructure of the machined surface was evaluated with 
eol JSM-T330A scanning electron microscope to supplement surface roughness mea- 
'ements and provide a qualitative view of machining process damage. Details of this 
:t of study can be found in Wang et al (1995). 

* Results and discussion 

this study it was found that chip formation was critically dependent on the fibre orien- 
ion (Wang et al 1995). The change in chip formation with fibre orientation was clearly 
able by examination of the macrochip. Discontinuous chip formation was observed both 
unidirectional and multidirectional laminate machining. 


la Cutting mechanisms: Cutting mechanisms present in machining of FRP materi- 
i in this study are shown schematically in figure 2. For the 0° material, chip formation 
jchanisms were composed of mode I loading and fracture along the fibre/matrix inter¬ 
ne, mode n loading through tool advancement, and fracture perpendicular to the fibre 
•ection under bending loads. In positive fibre orientations between 15° and 75°, the chip 
mation mechanisms included fracture from compression-induced shear across the fibre 
is combined with interfacial shearing along the fibre direction during chip advancement, 
lip flow in trimming all positive orientations up to and including 90° material occurred 
a plane parallel to the fibre orientation. Material removal in nearly all positive fibre 
ientations, therefore, appears to be governed by the in-plane shear properties of the uni- 
'ectional material. Chip formation for fibre orientations greater than 75° is primarily 
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Figure 2. Cutting mechanisms in the orthogonal machining of Gr/Ep. (a) Delamination; 
(b) fibre buckling; (c) & (d) fibre cutting; (e) deformation; (f) shearing. 


compression-induced fracture perpendicular to the fibres and interlaminar shear fracture 
along the fibre/matrix interface. These mechanisms are consistent with those previously 
documented by others (Koplev et al 1983; Sakuma & Seto 1983; Kaneeda 1991; Krishna- 
murthy et al 1992). Edge trimming of large fibre orientations is frequently accompanied 
by out-of-plane displacement and material fracture ahead of the cutting tool. Based on 
the results for multi-directional laminates, orthogonal cutting mechanisms for all fibre 
orientations were distinguished. The cutting mechanisms of 0° and 45° plies in edge trim¬ 
ming of multi-directional laminates were identical to those of equivalent unidirectional 
fibre orientations. Conventional trimming of 0° plies with the PCD tools occurred by a 
combination of mode I and mode 11 loading and subsequent fracture. Mechanisms of chip 
formation and material removal for 45° plies originate from shear loading. Fracture oc¬ 
curred perpendicular to the fibre axis within the compressive field of the contact zone, and 
fibre/matrix interface shear permitted chip flow. However, chip formation mechanisms 
in trimming 90° and —45° unidirectional material were slightly different from that for 
multi-directional laminates as a result of the influence of adjacent plies. When machin¬ 
ing unidirectional composites with fibre orientations of 90° or more, extensive degrees 
of out-of-plane damage occurred. In contrast, support provided from adjacent plies in 
the multi-directional ply structure allowed for machining with relatively small degrees of 
structural degradation. Trimming damage associated with 90° plies of the multi-directional 
laminate was limited to a loss of fibre/matrix cohesion evident at the trimmed surface. 
Damage to the —45° plies was more extensive through shear failure along the fibre/matrix 
interface and fracture perpendicular to the fibre direction beneath the trimming plane. Tool 
geometry appeared to have no discernible effect on the degree of trimming damage. In-situ 
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similar to features of chip formation evident in trimming unidirectional material with fibre 
orientations greater than 90° (negative orientations). In general, chip lengths on trimming 
the laminate were larger than those obtained in trimming unidirectional material. Post¬ 
process characteristics of chip formation observed in edge-trimming the multi-directional 
laminate were also quite similar to those of unidirectional Gr/Ep, particularly in the 0° 
and 45° plies. Fragmented chips attributed to split failure ahead of the cutting tool were 
produced in the trimming of 0° plies. Very tiny chips were noted from 45° plies with 
chip release along the fibre direction. The fracture plane for both 0° and 45° fibre orien¬ 
tations was parallel to the fibre direction as in the case of 90° plies. However, contrary to 
trimming unidirectional Gr/Ep, tiny chips were produced in 90° laminate plies without 
any indication of out-of-plane displacement. Material damage was essentially eliminated 
due to support provided by adjacent plies in the multi-directional lay-up. Similarly, chips 
produced from trimming —45° plies of the laminate resulted in severe damage below the 
trimming plane, but the extent of damage was limited due to support provided by adjacent 
plies. 


3.2b Cutting forces: In support of chip formation dependence on fibre orientation, prin¬ 
cipal cutting and thrust forces in this study were primarily influenced by fibre orientation; 
operating conditions and tool geometry had very little influence. Principal cutting forces 
ranged from 100N for 0° to slightly more than SOON for 90° material. Specific cutting 
energy ranged from 1 x 10^ to 7.8 x 10^ J/m^, significantly lower than that documented 
from orthogonal trimming of conventional homogeneous materials (Koplev et al 1983). 
Contrary to the mechanics of chip formation in metal cutting, the thrust force was nearly 
always greater than the corresponding principal force except when trimming material with 
fibre angles of 0°, and greater than or equal to 90°, as shown in figure 3. In general, thrust 
forces commonly increased with fibre orientation up to 45°, and then decreased to 90°. 
High thrust forces may be attributed to the elastic recovery of the fibres within the contact 
zone prior to fracture. The elastic energy of the deformed fibres, released when the fibres 
are severed, imparts a thrust force on the tool flank and is a potent source for tool wear. This 
analogy is supported by the fact that the thrust force in trimming 0° and 90° orientations 
is almost the same as they undergo the least bending-induced elastic deformation. These 
results are consistent with results reported by Koplev (1980), Sakuma & Seto (1983), and 
Inoue & Ido (1986) but not with those of Kaneeda (1991). 

Features of the cutting force profiles in trimming multi-directional laminate were nearly 
identical to those recorded in edge trimming 90° unidirectional material. The force pro¬ 
files exhibited a high frequency fluctuation in cutting force (Fc). However, tool geometry 
affected the magnitude of both Fc and its fluctuation in trimming of the multi-directional 
laminate. Compounded effects of tool rake and clearance angles on the principal and thrust 
cutting forces are shown in figures 4a, b; a depth of cut (t) of 0.13 mm and 9m/min cutting 
speed (V) were used for the simulations. Optimal rake angle (a) in terms of minimizing 
the principal cutting force was determined to be approximately 7° as shown in figure 4a. 
Thrust force, however, increased with increasing rake angle but decreased with increasing 
clearance angle. The optimal tool geometry for minimizing the resultant cutting force over 
the range of ooerating conditions in this studv would consist of a tool with 6° ~ l°a and 



•15 IS 45 75 105 135 165 195 .15 15 45 75 105 135 165 195 

fibre orientation (degrees) orientation (degrees) 


Figure 3. Cutting forces in edge-trimmin'g'with a 10°Q!/17°y PCD tool. Cutting conditions; 
4m/min cutting speed, 0.25 mm depth of cut. (a) Principal cutting force; (b) thrust force; 
(c) resultant cutting force; (d) resultant force orientation. 


17°y angles. Increasing the clearance angle beyond 17° would result in lower thrust and 
hence resultant forces, but would ultimately result in a reduction of tool life. The optimal 
tool geometry to minimize cutting forces in orthogonal trimming of Gr/Ep in this study is 
in excellent agreement with that reported by Park (1991), which was generated from the ^ 
aspect of minimizing tool wear. 

Over the range of cutting speeds available from the planer/shaper machine {V < 
20m/min), essentially no change in cutting force was noted. However, principal cut¬ 
ting force increased linearly with depth of cut and in turn influenced the magnitude of the 
resultant cutting force and its orientation with respect to the plane of trimming. Although 
principal cutting force decreased with increasing rake angle, smaller resultant cutting forces ' 

were obtained with a 0° rake angle tool rather than positive rake angle PCD inserts due to 
the influence of thmst forces. In addition, smaller cutting forces were obtained for a 17° 
clearance PCD insert rather than a 7° clearance angle. 

3.2c Surface roughness: Surface quality in orthogonal trimming of FRP materials is 
closely related to chip formation and, hence, fibre orientation. Due to the unique chip 
formation mechamsms of the 0° fibre orientation and bare fibres on the machined surface, 

Ra and Ry values in the longitudinal direction were usually higher than those in positive 
fibre orientations. The average surface roughness of fibre orientations from 15° to 60° 
in both longitudinal and transverse measurement directions exhibited high quality with 







(a) 


220 


Fc (N) 
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Figure 4. Effect of tool geometry on the cutting forces. Cutting conditions: 4 m/min cutting 
speed, 0.13 mm depth of cut. (a) Principal cutting force; (b) thrust force. 


ghness between 1 and 1.5 |xm. Very few changes were noted in roughness measurements 
jveen fibre orientations of 15° and 60°. Surface texture measurements were not obtained 
positive orientations greater than 60° or negative orientations due to severe cutting 
lage as previously discussed. Peak-to-valley height (Ry) values in figure 5 exhibited 
ids similar to longitudinal and traverse direction Ra measurements though with a more 
sitive response to fibre orientation. 

lurface roughness measurements for laminate composites were significantly influenced 
the variation in ply angles and measurement direction. Matrix smearing was dominant 
45° and 90° fibre pUes, and bare fibres were usually observed on the 0° ply. The domi- 
t feature of -45° plies was the “in-depth” damage due to fibre pullout and intralaminar 
:king (delamination) which induced deep valleys in profile measurements taken trans- 
se to the trimming direction. Longitudinal measurements taken parallel to the cutting 
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Figure 5. Tool-geometry influence on surface texture. Cutting conditions: 9 m/min cutting 
speed, 0.25 mm depth of cut. Average roughness with Ty (a); and 17°x (b) average peak- 
to-valley height with Ty (c) and 17°y (d). 


direction did not indicate the extent of trimming damage, even when acquired on the 
—45° plies as shown in figure 6. Therefore, average surface roughness (Ra) and maximum 
height roughness {Ry) alone do not appear capable of indicating the “tme” surface quality 
of a trimmed FRP without additional methods of observation. A combination of standard 
roughness parameters and a statistical representation of profiles taken perpendicular to the 
feed direction as shown in figure 6 may be more appropriate for a qualitative inspection. 


3.3 Conclusions 

Orthogonal cutting mechanisms in the edge-trimming of graphite/epoxy laminates with 
polycrystalMne diamond tools were studied. Chip formation, cutting force, and surface 
morphology were evaluated with respect to tool geometry, process conditions, and ply 
distribution in the laminate. As with the trimming of unidirectional panels, discontinuous 
chips were observed with primary dependence on fibre orientation. Cutting mechanisms 
for 0° and 45° plies were identical to those in the trimming of unidirectional material. How¬ 
ever, chip-formation mechanisms in trimming 90° and -45° plies of the multi-directional 
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Figure 6. Transverse profile of the Gr/Ep 
lanunate. Cutting conditions: 4m/min 
cutting speed, 0.25 mm depth of cut, 
0°a/17°y tool, (a) Profile height; (b) prob¬ 
ability density of profile height; (c) cumu¬ 
lative height distribution. 






4.1 Materials and procedures 


Graphite/epoxy laminate material with 3501-6 resin and IM-6 fibres was used for all 
experiments in this preliminary investigation. Compression specimens were composed 
of plies of unidirectional tape (0° plies), plain-weave fabric (0°/90°), and bias-weave 
fabric (45°/—45°). Flexure specimens consisted of unidirectional tape laminated with 
combinations of 0°, 90°, 45°, and —45° plies. Panels used for both phases of testing were 
hand laminated with lay-up indicative of that used in aerospace structures. 

Compression specimens were acquired from a bulk laminate panel using either dia¬ 
mond abrasive cutters or AWJ machining according to a standard aerospace coupon design 
(Colligan 1993). All specimens were first rough-machined with a #30 grit diamond abrasive 
cutter. A test section of the dog-bone specimen was obtained using either diamond abrasive 
cutters or the abrasive waterjet incorporating various parametric conditions which provided 
a range in edge quality received from each machining technique. Flexure and impact spec¬ 
imens were obtained using three manufacturing techniques, namely traditional trimming 
with polycrystalline diamond (PCD) cutting tools, AWJ machining, and diamond-saw 
machining with a #220 garnet diamond-impregnated slot saw. Cutting conditions used 
in obtaining the compression and flexure specimens are available in Colligan (1993) and 
Arola & Ramulu (1994) respectively. Surface profiles of the machined specimens were ob¬ 
tained using a stylus profilometer according to ANSI B46.1-1985. Surface topography of 
the machined specimens was analysed from profile data using standard roughness param¬ 
eters, statistical methods, and random process methods in addition to visual techniques. 
Details concerning analysis procedures are provided in the aforementioned references. 

An MTS Universal Test Machine was used for compression tests with a stroke rate of 
0.8 mm/min, hydraulic grips, and buckling constraint. An extensometer was used to record 
strain data. Flexure and impact specimens were loaded to failure according to ASTM 
standard D790M (ASTM 1986). Four-point and three-point flexure configurations were 
used with 16-to-l span-to-depth ratio. Machined edges of the flexure and impact specimens 
were parallel to the plane of apphed load. Testing performed in both modes of loading 
was performed at ambient temperature and humidity. Ultimate load and associated failure 
phenomena were correlated with quahtative data of the machined specimens to study 
the influence of manufacturing and surface quality on mechanical performance under 
compression and bending loads. 

4.2 Results and discussion 

Study of machining influence on the performance of Gr/Ep subjected to compression and 
pure bending under four-point loading was conducted (Colligan 1993; Arola & Ramulu 
1994). Prior to conducting strength analysis, topographies of the machined surfaces ob¬ 
tained with each method of machining were analysed using profilometry. Typical profiles 
obtained from the machined surface of diamond-saw, PCD-trimmed, and AWJ-machined 
specimens as well as the profile probability density, and cumulative height distribution are 
shown in figures 7a-c respectively. Profiles in figure 7a were obtained perpendicular to 
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Figure 8. Damage to 45° ply of PCD edge-trimmed Gr/Ep laminate. 


the major axis (feed direction). In general, all three techniques produced relatively high 
quality machined surfaces. However, although the average surface roughness (Ra) of the 
PCD specimens ranged between 1 and 2 pm, extensive fibre pullout occurred with the 
—45° plies as evident in figure 8. Plies in this orientation exhibit fibre direction with slope 
opposite to the cutting direction and commonly fail along the fibre/matrix interface during 
traditional edge-trimming. Compression specimens obtained with the AWJ and diamond 
abrasive cutters were machined at process conditions that provided a wide range in surface 
quality. Average surface roughness of specimens fabricated with each technique ranged 
from 5 to 25 pm Ra. 

Following the qualitative inspection of the machined surfaces, compression, flexural and 
impact testing was conducted. While recording the stress strain response of compression 
specimens, it was noted that the AWJ-machined specimen loading response appeared to be 
a function of surface quahty. Typical stress strain curves for three AWJ machined specimens 
of differing quahty are shown in figure 9. Initiation and propagation of failure occurred 
first in the specimens with high surface roughness and exit ply delamination. Typical 
load displacement histories for PCD-trimmed, diamond-saw and AWJ-machined flexure 
specimens are shown in figure 10. Load deflection curves obtained during application of 
the bending load consistently suggested that permanent damage, i.e. failure initiation and 
propagation, occurred first in PCD-trimmed specimens. 
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Figure 9. Stress/strain history of AWJ-machined Gr/Ep compression specimens. 


Jubsequent analysis showed that surface roughness resulting from trimming with dia- 
nd abrasive cutters had essentially no influence on the ultimate compression strength, 
wever, although surface roughness of AWJ specimens had notable but limited influence 
the ultimate compression strength within comparable roughness of diamond abrasive 
cimens, exit ply delamination had significant influence. The relationship between sur- 
s roughness and ultimate compression strength from both processing techniques are 
wn in figures lla-b. The correlation between exit ply delamination resulting from 
trimming and ultimate compression strength is also shown in figure 11c. 

Jimilar to compression specimen analysis, characteristics of the flexure specimen ma¬ 
ned surface indicative of each fabrication method were quite different, but no distinction 
iltimate strength could be made from average surface roughness. Mean and characteris- 
strengths calculated for the three specimen batches from Weibull statistics were nearly 
livalent. 

fypical preliminary impact test results (Arola & Ramulu 1995) are shown in figure 12. A 
ical load and the load-line displacement profile for a diamond-saw specimen is shown 
figure 12a. The features of the impact event of interest include the peak load, peak 
;ture load, and the total energy to failure. Representative load and load-line displace- 
nt profiles for PCD and AWJ specimens are shown in figure 12b-d. Unfortunately, the 
lamic sensors available to record specimen displacement in this study had a travel limit 
5 mm. Therefore, the energy to complete failure, represented by the integral of the total 
remental load displacement curve, was not available. Hence, the energy calculated to 
limit of the displacement sensors is essentially the total energy to fracture. Averaged 

llf'C fnr imnor’t r\rr»r\i=»r<-i<=io r\-f 1 HA in Onlv minimal Hif- 
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Figure 10. Flexure specimen geometry and loading configuration. 
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Figure 11. Influence of processing on the ultimate compression strength, (a) Machined 
with diamond abrasive eiittersr Ch') AWJ-machined; (c) AWJ comnression Strength vs. exit 









Impact Load (N) S Impact Load (N) 


M Ramulu 




Figure 12. (a and b. Caption on facing page.) 
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Figure 12. Typical recorded load, load-line displacement, (a) diamond-saw, (b) PCD, 
(c) AWJ (a) (no. 50 garnet). (d) AWJ (c) (no. 150 garnet). 
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Table 3. Impact loading failure parameters. 


Machining type 

Peak load (N) 

Peak fracture load (N) 

Energy to fracture (N-m) 

Diamond-saw 

1285.8 ±60.8 

1057.9 ±207.9 

3.43 ±0.42 

Planer (PCD) 

1235.1 ±29.8 

1028.0 ± 155.1 

3.51 ±0.15 

AWJ (a) 

1215.0 ±67.6 

817.7 ±72.0 

3.04 ±0.15 

AWJ (b) 

1258.1 ±44.6 

904.5 ± 107.5 

3.24±0.14 

AWJ (c) 

1262.6 ±48.1 

958.4 ± 206.9 

3.29 ±0.26 


in peak fracture load and the energy dissipated varied with the method of machining. As 
expected, the diamond-saw had the highest peak load to failure and energy to fracture. 

Scanning electron microscope analysis of failed specimens suggested that features re¬ 
sulting from traditional trimming with PCD tools may have given rise to unique failure 
propagation. Extensive macro-cracking along the —45° plies occurred in PCD-trimmed 
specimens extending through the entire specimen length. Conversely, diamond-saw and 
AWJ-machined specimens exhibited much higher post-failure integrity with far fewer 
macro-cracks. Results from compression, flexure and impact tests suggest that surface 
finish alone does not degrade a laminate. Delaminations incurred during trimming control 
residual strength. 

4.3 Conclusion 

Machining-induced surface features discussed in this study had limited influence on bulk 
strength of the Gr/Ep material but may affect the impact or fatigue strength where flaws 
in material structure become more important. Preliminary results of impact tests indeed 
revealed the effect of edge-finishing on the peakloads to failure. In these loading config¬ 
urations, fibre pullout and fibre matrix debonding, as well as the high values of surface 
roughness, cause flaw propagation and accelerated failure. Further study of the effects of 
machining-induced damage on composite material performance will address this issue. 
Studies by Howarth & Strong (1990) combined with results from this preliminary investi¬ 
gation confirm that aspects of net-shape manufacturing, including the particular technique 
used and the resulting cut characteristics, may influence the structural integrity of FRP’s. 
However, the mechanisms responsible for reduction in mechanical performance are not 
clearly understood. 


5. Summary 

Investigation of machining mechanisms and machining induced edge effects on the struc¬ 
tural integrity of FRP materials was conducted. Consistent with previous reports, results 
from compression, flexural and impact testing to failure suggest that particular features 
of the machined surface may not only affect the bulk strength, but also the initiation and 
propagation of failure. Therefore, for predicting the future applications of FRP materials, 
based on the current state of applicable net-shape manufacturing technology, additional 
investigations are necessary to thoroughly understmd the effects of the methods of man¬ 
ufacturing on the structural integrity of FRP materials. 
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Abstract. Ceramic materials are finished primarily by abrasive machining 
processes such as grinding, lapping, and polishing. In grinding, the abrasives 
typically are bonded in a grinding wheel and brought into contact with the 
ceramic surface at relatively high sliding speeds. In lapping and polishing, the 
ceramic is pressed against a polishing block with the abrasives suspended in 
between them in the form of a slurry. The material removal process here re¬ 
sembles three-body wear. In all these processes, the mechanical action of the 
abrasive can be thought of as the repeated application of relatively sharp slid¬ 
ing indenters to the ceramic surface. Under these conditions, a small number 
of mechanisms dominate the material removal process. These are brittle frac¬ 
ture due to crack systems oriented both parallel (lateral) and perpendicular 
(radial/median) to the free surface, ductile cutting with the formation of thin 
ribbon-like chips, and chemically assisted wear in the presence of a reactant 
that is enhanced by the mechanical action (tribochemical reaction). The rel¬ 
ative role of each of these mechanisms in a particular finishing process can 
be related to the load applied to an abrasive particle, the sliding speed of the 
particle, and the presence of a chemical reactant. These wear mechanisms also 
cause damage to the near ceramic surface in the form of microcracking, residual 
stress, plastic deformation, and surface roughness which together determine the 
strength and performance of the finished component. A complete understand¬ 
ing of the wear mechanisms leading to material removal would allow for the 
design of efficient machining processes for producing ceramic surfaces of high 
quality. 

Keywords. Ceramic surfaces; abrasive machining processes; surface finish¬ 
ing; wear mechanisms. 


1. Introduction 

The thrust towards improved efficiency in gas turbines and internal combustion engines, 
the improved performance of wear resistant components, and the unique electrical and 
magnetic property requirements in electronic devices and sensors have focused attention 



ceramics over other materials include high hardness and strength at elevated temperatures, 
chemical stability, low friction, and high wear resistance. However, those properties that 
give ceramics superior wear resistance also make them difficult to machine. In addition, 
the limited ductility of these ceramics makes forming methods that rely on extensive 
plastic deformation useful only for ceramics in the green state. Thus, extensive machining 
is required for manufacture of complex shapes with high quality surfaces. Machining 
costs can constitute up to 80% of the cost of the manufactured component. A detailed 
understanding of the wear mechanisms underlying the machining of ceramics and the 
damage that they leave behind should allow for more economical manufacture of reliable 
advanced ceramic components. 

Recent authors have reviewed several aspects of wear of ceramics including Braza 
et al (1989) who overviewed its relationship to contact fatigue, Larsen-Basse (1994) who 
compared and contrasted wear of ceramics with much that is already known about cemented 
carbides and cermets, and Jahanmir & Dong (1994) who give examples of maps of wear 
regimes for relevant contact pressure and temperature. The following reviews mechanical 
aspects of wear associated with machining. 


2. Material removal processes 

While turning and milling ai'e used extensively in machining of metals, they are not ef¬ 
ficient for fully densified ceramics due to rapid tool wear and large amounts of surface 
damage. Diamond turning can be used for machining ceramics in the green state but finish 
machining is still required on the densified ceramic. Thus, surface finishing of ceramics is 
primarily accomplished by abrasive finishing processes such as diamond grinding, lapping, 
and polishing. These processes are required to meet the stringent tolerance and surface 
finish requirements in structural and electronic ceramics (e.g. < 0.05 p.m rms in magnetic 
recording heads, silicon wafers, face seals, and bearings). The abrasive processes remove 
material mechanically and introduce damage on the surface of ceramics (Marshall et al 
1983). This damage is usually in the form of residual stresses and cracks which have a 
major influence on the mechanical properties and integrity of the machined ceramic sur¬ 
face. Moreover, plastic deformation and subsequent residual stresses induced by surface 
finishing alter electro-magnetic properties such as permeability, resistivity, and refractive 
index of the surface in electronic ceramics (Stokes 1972) causing deterioration of their 
performance in electronic devices. 

In general, the material removal rates (MRR) in grinding are higher than the MRR in 
lapping and polishing. The higher MRR in the initial rough grinding operation leads to 
surface damage in the form of microcracks which may extend as deep as 100 p.m into 
the surface. Rough grinding is followed in turn by fine grinding, lapping and polishing in 
which this damage is removed to varying degrees. Furthermore, the lapping and polishing 
operations may also leave damage behind on the surface in the form of residual stresses 
and severely plastically deformed layers. 
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Grinding of ceramics is accomplished primarily with grinding wheels containing dia¬ 
mond abrasive grits (Subramanian 1988). The diamonds are fixed to the wheel through 
relatively compliant resin bonds or stiff vitrified bonds. The grits are statistically distributed 
over a range of sizes with an average size of ~ 100 |xm in wheels designated as coarse and 
~ 5 jjLin in wheels designated as fine. Typical process parameters used in grinding include 
wheel surface speeds of 25 to 50 m/s, depths of cut of 0.5 to 30 jxm, and table traversal 
speeds of ~ 20 mm/s. These parameters lead to an MRR per unit wheel width of the order 
of 0.1 to 1 mm^/mm/s and normal grinding forces per unit wheel width of 5 to 100 N/mm. 
For a given depth of cut, the grinding force typically increases as the abrasive particle size 
in the wheel decreases. Here the depth of cut refers to the depth of material removed in a 
single traversal of the wheel across the ceramic surface. However, the depth removed by 
a single particle is much smaller than this and varies over the length of contact between 
wheel and workpiece. The length of contact is / — y/lRd (neglecting wheel deflection) 
where R is the radius of the wheel and d is the depth of cut; typical contact lengths are in 
the range of 1 to 2 mm. 

The force applied to the grinding wheel produces wheel and machine deflections so that 
the actual length of contact varies from that calculated using geometry (Hucker et al 1994). 
The wheel deflection is due to the localized contacts between the stiff diamond abrasive 
particles (embedded in a compliant bond) and the workpiece as well as a global deflection 
due to the stress distribution associated with transmission of the total contact force through 
the wheel. A definitive calculation of the number of active abrasive particles, i.e. the number 
of particles actually engaged in the cutting action, and the distribution of forces on these 
particles is as yet unavailable. However, the statistical distribution of particles on the wheel 
surface suggests that many of the particles have depths of cuts much less than but of the 
order of the grinding depth of cut. Furthermore, the contact pressure between a single 
particle and the ceramic surface is very high and approximately equal to the ceramic 
hardness. This is to be contrasted with the average pressure across the wheel-ceramic 
interface estimated from force measurements, which is well within the elastic range. 

The large relative sliding velocity produces high temperatures at the abrasive-workpiece 
interface. For typical grinding conditions, infrared radiometric measurements of the peak 
temperatures have shown them to be as high as 1300°C (Hebbar et al 1992). These values 
are consistent with measurement and analysis of temperatures in single point grinding 
under similar grinding conditions. Further substantiation of high grinding temperatures is 
provided by the presence of spherical particles in grinding swaif while grinding hardened 
steels (Lu et al 1992). The grinding temperatures are localized near the surface and the 
resulting thermal gradients generate thermal stresses which are also important in under¬ 
standing the grinding process. 

The localized nature (i.e. local to the single abrasive particle-ceramic contact) of the con¬ 
tact stresses and temperatures during grinding, and a variety of observations pertaining to 
deformation and stresses on machined surfaces, strongly suggest that to understand the ma¬ 
terial removal mechanism during ceramic grinding, it would be useful to analyse the sliding 
indentation of a ceramic surface by a hard particle under depths of cut and sliding velocities 
occurring in grinding. Indeed, this view underlies much of the following discussion. 

Lapping and polishing of ceramics are carried out by placing a slurry of abrasive particles 
in a liquid vehicle between the specimen and a hard block (lapping) or a soft pad (polishing). 



476 


S Chandrasekar and T N Farris 


The block is loaded against the workpiece by hydraulic or mechanically applied pressure 
and rotated at slow speed. The particles roll and slide across the ceramic surface so that the 
wear resembles three-body wear as opposed to the two-body wear in grinding. Polishing 
is usually performed after initial grinding has been used to generate the workpiece shape; 
its primary purpose being the generation of smooth surfaces. A hard lapping block is used 
when stringent tolerances on the flatness of the workpiece are required, while polishing gen¬ 
erates a much smoother surface since many of the abrasive particles are embedded in the soft 
lapping block. Diamond particles are used extensively but softer abrasives such as AI 2 O 3 , 
SiC, and cerium oxide are also widely used. As in grinding, the particles are statistically 
distributed over a range of sizes with the average particle size ranging from 0.05-70 pum. 

The mean sliding velocity between the abrasive and the ceramic in lapping and polishing 
is of the order of 0.5 m/s or less which is two orders of magnitude less than that in grinding. 
In contrast to grinding, the sliding induced temperatures are thought to be insignificant in 
lapping and polishing. The forces on individual abrasive particles vary, with the total force 
applied to the block or pad being a process variable. In general, the surface roughness (Ra) 
of the finished surface increases with increasing lapping pressure. Typical removal rates 
in lapping and polishing range from 0.001 - 1 mm^/s which is less than that observed in 
grinding for a typical grinding wheel of 5 mm in width. The smaller MRR suggests that 
less damage is left behind on the surface from lapping and polishing than from grinding. 
As in grinding, the material removal can be viewed as due to sliding indentation for 
the particles that slide, and due to quasi-static indentation for particles that roll. This 
idealization is consistent with microscopic observations of lapped and polished surfaces, 
and wear particles formed by these processes (Chauhan et al 1993). 

Recently, a model for the distribution of abrasive particle forces during lapping based on 
statistical size distribution of the particles has been developed (Chauhan et al 1993). In that 
paper, the required compliance of the particle workpiece contact is calculated by assuming 
that it is the same as that for an indentation with a conical indenter. An interesting result 
from the calculation is that approximately only 1 out of 10 ^ particles is actively engaged 
in material removal at a given time. Further, by assuming that the surface roughness of 
the finished surface is related to the depth of the plastic zone produced by a particle, the 
surface roughness could be predicted from the properties of the abrasive particles and the 
workpiece, and the lapping pressure. The mean surface roughness, Ra , is found to be related 
to the average particle force while the peak-to-valley surface roughness, Rt , is related to the 
maximum force applied by a particle. The predictions of this model are, for most cases, in 
excellent agreement with experimental observations from lapping and polishing of AI 2 O 3 , 
soda-lime glass, and Ni-Zn ferrite using SiC abrasive slurries of different particle sizes. 
A similar calculation has not been carried out for grinding in part because of difficulties 
associated with measurement of the surface profile of the grinding wheel and lack of 
characterization of the statistical distribution of particle sizes on the wheel surface. 


3. Material removal mechanisms 

A glimpse into the wear processes that cause material removal during the abrasive ma¬ 
chining of ceramics is provided by optical and electron microscopy of machined ceramic 
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Figure 1. SEM micrograph of ribbon-shaped polishing chips from soda-lime glass. 

surfaces and machining chips. Such microscopic observations have shown that material 
removal occurs as a consequence of one or more of the following: lateral cracks breaking 
open onto the surface; gross fracture due to grain pull-out; median/radial cracks inter¬ 
secting with each other; and plastic micro-cutting through the fonnation of chips as in 
single-point turning of metals. These observations have also been confirmed through elec¬ 
tron microscopic studies of chips and wear particles formed by machining. The dominance 
of a given mechanism is closely related to the loads applied by individual abrasives on the 
ceramic surface during machining. Typically, when the externally applied loads transmitted 
by the abrasives are small, plastic micro-cutting or indentation (with upward displacement 
of material around the indent) mechanisms are found to dominate. This is particularly so 
when polishing at low loads and/or with a flexible, soft polishing cloth (polishing block) or 
in the so-called “ductile regime” mode of grinding. The surfaces produced by these modes 
of material removal/displacement are characterized by their extreme degree of smooth¬ 
ness. Also, the plastic micro-cutting action of material removal leads to the formation of 
thin ribbon-like chips even when machining brittle solids such as glass, ferrites, or MgO. 
Figure 1 shows such a chip formed during the polishing of soda-lime glass. Machined 
surfaces of ceramics such as Ni-Zn ferrite, soda-lime glass, Si 3 N 4 , and zirconia created by 
this plastic mechanism of removal do contain residual stresses of the order of 20-50 MPa 
(Chandrasekar etal\99\). Furthermore, dislocation etch-pitting experiments on ground 
and polished blocks of single crystal MgO show the presence of an intensely deformed 
surface layer with high dislocation densities. Both these observations reinforce the hypoth¬ 
esis of a plastic micro-cutting mechanism of material removal as well as the formation of 
microindentations and plastic scratches on machined ceramic surfaces. The plastic mate¬ 
rial removal mechanism in nominally brittle solids is no doubt a consequence of the large 
hydrostatic stresses generated in a small volume underneath the ceramic surface by the 
abrasive particles. 

When lamer loads are transmitted tbrnnnb the abrasives, a transition is Seen tO OCCUr in 





Figure 2. Schematic of sliding indenta¬ 
tion and resulting fracture. 


brittle fracture. The most common modes of material removal by brittle fracture are due to 
lateral cracks breaking open onto the surface, grain pull-out, or a crushing-type of material 
removal. Again, these conclusions have been reached from microscopic observations of 
machined surfaces and wecir particles. 

In order to understand better these mechanisms, and the driving forces behind them, it 
is worthwhile to analyse a reduced model for the machining of ceramics. This is provided 
by the sliding indentation process which entails a single abrasive particle sliding across 
the ceramic work-piece. This configuration has been examined extensively in the literature 
(Broese van Groenou et al 1979; Swain 1979; Evans and Marshall 1981; Cheng and Finnie 
1990; Ahn et al 1993). A schematic of this experiment and the resulting crack pattern are 
shown in figure 2. As summarized by Ahn et al (1993), the experimental results delineate 
different force regimes in which the various crack patterns illustrated in figure 2 occur. For 
a Vickers indenter sliding on soda-lime glass, the results can be summarized as follows. 
At normal loads less than ~ 0.05 N, no cracking is observed but a groove is formed 
indicating locahzed plastic deformation. The formation of such grooves in some cases 
is a consequence of material removal by a plastic-cutting mechanism, which produces 
chips similar to those shown in figure 1. In other cases, the groove is merely a sliding 
indent, i.e. the material within the groove is displaced mostly to the sides of the groove. In 
this latter instance there is no material removal but just a plastic indentation. In the force 
range of 0.05-0.8 N, a median crack is observed perpendicular to the surface. The depth 
of the median crack increases with normal force. In the range 0.8—3 N median cracking is 
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accompanied by lateral cracking parallel to the surface. At loads in the higher portion of 
this range, lateral cracks break through to the surface causing material removal. At still 
higher loads, say 3-6 N there is considerable crushing of particles in the scratch groove, 
along with a median crack. 

A stress analysis that approximates the localized inelastic deformation using a sliding 
blister field has been completed by Ahn et al (1993). This was obtained by extending 
the blister field model for static indentation proposed by Yoffe (1982). In this model the 
strength of the blister field is evaluated as a function of the indentation load by measuring 
the volume of the scratch groove (figure 2). The value of the tensile stress in the uncracked 
solid at the location of the different systems is reported by Ahn et al (1993). The stress 
that would initiate a lateral crack is found to be small for low loads and equals the median 
crack driving stress near loads at which lateral cracks are typically observed to occur. That 
is, the sliding blister model accurately predicts the experimentally observed critical load 
for lateral crack formation in soda-lime glass. To date, a complete stress analysis of tlie 
sliding indentation of brittle materials does not exist. 

It must be observed here that the sliding indentation forces in the results summarized 
above are in the range of forces applied to abrasives during lapping and polishing. The 
lapping force calculations described earlier yield average particle forces in the range 0.03- 
0.6 N for the lapping and polishing of glasses, ferrites, and AI 2 O 3 under typical conditions. 
The smaller forces correspond to abrasive particle sizes of ~ 1 |jim while the higher forces 
correspond to particle sizes of ~ 63 pm. The smaller of the force values are in the range 
where plastic material removal by cutting has been observed both during polishing and 
sliding indentation. The higher particle forces are well in the range of conditions for 
which lateral cracking is a dominant mechanism of material removal in sliding indentation 
experiments. 

The calculation of volumetric wear as a function of process variables in abrasive machin¬ 
ing processes has not been carried out to any significant extent. Evans & Marshall (1981) 
used fracture mechanics to obtain a formula for wear volume as a function of applied force 
and ceramic material properties, based on a mechanism of lateral cracking to describe 
material removal. However, their formulae for predicting the onset of lateral cracking as 
well as volumetric removal rates have not been validated by experiment, e.g. Larsen-Basse 
(1994). The analytical estimation of such wear rates during the grinding, lapping, and pol¬ 
ishing of ceramics, based on the different wear mechanisms that are operative, is a problem 
worthy of detailed study. 


4. Discussion 

This short review of recent work on abrasive machining of ceramics has highlighted the 
two main mechanisms of weai’ associated with material removal: ( 1 ) brittle fracture with 
lateral cracks intersecting the machined surface, when the load applied by an abrasive 
particle is high, and ( 2 ) ductile micro-cutting with chip formation taking place as in single 
point machining of metals. The evidence for these mechanisms has come from microscopic 
observations of machined surfaces, chips and wear particles; from a consideration of the 
forces and pressures applied by the individual abrasive particles in machining, and from 
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the nature of deformation occurring in ceramic surfaces under sliding microindenters sub¬ 
jected to loads similar* to those acting on abrasive particles. Other observations pertaining 
to residual stresses, magnetic property changes, microcracking and strength of machined 
ceramics provide further support for these mechanisms. For example, lapped and finely 
polished surfaces of crystalline ceramics typically have residual compressive stresses and 
high dislocation densities in shallow surface layers which support a mechanism of material 
removal by plastic cutting, and material displacement by indentation. Furthermore, such 
surfaces show little evidence of microcracking and there is very little strength degrada¬ 
tion and strength anisotropy in lapped and finely polished ceramics. In contrast, coarsely 
polished or ground surfaces of ceramics show considerable microcracking in the surface 
layers as well as strength anisotropy and strength degradation. This is consistent with a 
mechanism of material removal and wear taking place by brittle fracture. 

In conclusion, the cost effective machining of ceramics necessitates an understanding 
of the deformation processes and stress fields in ceramics produced by a sliding indenter 
under conditions of pressure and temperature prevalent in machining. By controlling and 
exploiting the transition between wear particle formation by ductile cutting and by brittle 
fracture, it should be possible to increase material removal rates and significantly improve 
the quality of the machined surface. The use of active chemical reagents to enhance or 
decrease material removal through tribochemical reactions at the interface between the 
abrasives and the ceramic workpiece should also be beneficial in this regard. 


The research was supported in part by the National Science Foundation through grants 
MSS 9057082, Jom Larsen-Basse, Program Director and DDM 9057916, Bruce Kiramer, 
Program Director. 
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?’oreword 

-t is with great pleasure that we present to the Indian engineering community this special 
-ssue of Sadhana on optimization. Optimization pervades most, if not all, engineering 
endeavours, cutting across interdisciplinary barriers. It can manifest in cost minimization, 
minimization of energy loss in a power system or delays in a communication network, 
and so on. It can be ‘static’ as in linear programming or ‘dynamic’ as in control. In all 
its multiple avatars, it remains a forever young and vibrant discipline that never ceases to 
surprise us with newer, more powerful techniques and conceptual breakthroughs. 

In its pedagogical treatments, the word ‘optimization’ has somehow come to be associ¬ 
ated with its operations research aspects. But that is only one facet of it: Error minimization 
in statistical signal processing is as much of an optimization problem as, say, a transporta¬ 
tion problem. We have taken this broader view of the subject and the wide spectmm of 
articles will reflect this. Needless to say, the choice is limited both by what can be achieved 
within the confines of a single issue and, even more so, by the biases inherent in our 
own backgrounds. Thus it represents optimization issues in operations research, electrical 
engineering and artificial intelligence. Within this context, we have tried to get a very 
representative picture of where the subject is and where it is headed. Our hope is that we 
have succeeded “not wholly or in full measure, but very substantially”. If so, a lion’s share 
of the credit goes to our contributors. We take this opportunity to express our heartfelt 
gratitude to them. We also thank our referees for their valuable time and effort. We thank 
Ms. Shashikala, Ms. Nair and their colleagues at Sadhana for the effort they have put into 
the making of this issue. Finally, we record our special thanks to Prof. N Viswanadham, 
Editor, Sadhana , who initiated the process and encouraged us all the way. 
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Abstract. We study linear-quadratic adaptive tracking problems for a spe¬ 
cial class of stochastic systems expressed in the state-space form. This is a 
long-standing problem in the control of aircraft flying through atmospheric tur¬ 
bulence. Using an ELS-based algorithm and introducing dither in the control 
law we show that the resulting control achieves optimal cost in the limit, while 
simultaneously the unknown parameters converge to their true values. 

Keywords. Stochastic adaptive control; tracking problem; ELS-based 
estimator. 


Introduction 

^re is an enormous amount of literature on stochastic adaptive control starting with the 
neering work of Astrom & Wittenmark (1973) on the self-tuning regulator. Most of the 
jarch in this area, however, concentrated on ARMAX models (Kumar 1985). Parallel 
his was the somewhat unrelated development of the design of adaptive flight control 
terns, starting with the thesis of Illiff (1973). Most researchers in this area start with 
dynamical model of aircraft in flight and, consequently, formulate their problem in the 
;e-space form. Unfortunately, the literature on stochastic adaptive control for systems 
tate-space form is rather limited. Kumar (1983) made a thorough analysis on the prob- 
i of controlling an unknown linear-Gaussian system with quadratic criterion but had to 
trict himself to the case of complete observation of the system states. We study here 
problem of controlling a linear system with incomplete and inaccurate observation 
the system states so that a quadratic tracking criterion is minimized in the situation 
en the matrix multiplying the control term in the state equation is unknown. We do not 
ume that the observation noise is Gaussian, but do restrict ourselves to the situation 
;h no state noise. This problem arises naturally in controlling the flight of an aircraft in 

tViA nhiprtivp. iR tn minimize the normal acceleration or gust 
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and pilot comfort. It corresponds to our problem when the so-called control derivatives of 
the aircraft are unknown. 


2. Problem formulation 


Consider the following discrete-time dynamical system 

( 1 ) 
( 2 ) 


Xfc+i = Axjt 4- 
yk~CxkA-Wk, 


where Xk and yj^, for fixed fc, are R"- and R'”-valued state and observation vectors respec¬ 
tively; Uk is an R^-valued control vector, {wk} is a noise sequence to be specified below, the 
matrices A and C are known, while the matrix B is unknown. Our objective is to minimize 
the tracking criterion 

J= lim sup 
N-^oo N 

where 


N-l 

Jm{u)= Yliixk - x^y Qi(xk - Xi^) + u[Q 2 Uk], ( 4 ) 

Jk=0 

with Qi >0, Qi > 0 and {x:^} a prescribed sequence of desired path. 

We follow the self-tuning approach which is based on the certainty equivalence principle. 
Thus we first determine the optimal control law assuming that B is known, and then replace 
B by an appropriate on-line estimator. Recursive estimator for B may be obtained by using 
extended least squares (ELS), maximum likelihood (ML), stochastic approximation (S A) 
or Kalman filter methods. In the specific problem considered here, the Kalman filter method 
yields readily a recursive estimator for B (Balakrishnan 1987). The estimator also has 
been proved (Balakrishnan 1987) to converge to the tme value in the mean-square sense. 
When we close the control loop the analysis becomes more complicated. Asymptotic 
optimality of the resulting control law has not been established yet. One difficulty is 
that the estimator loses the interpretation of being the conditional espectation when the 
system operates in closed loop. We propose in the next section an ELS-based method to 
estimate B. 

The rest of this section is devoted to determining the optimal control law when jB is 
known. For this, we make the following assumptions: 


Al. -h - 1 > 0 V;i e [0,27r), where a(z) = det(/ - zA) = 1 -f 

ci\z + • • • 4- a„z", z € C. This is the strict positive real (SPR) condition. Note that 
this condition implies aiz) ^0 V|zl < 1 (see Chen & Guo 1991, corollary 4.1). 

A2. (A, C) is observable. 

A3. (A, B) is controllable and (A, D) is observable, where D is any matrix satisfying 
D'D^Qi. 

A4. {ujjt, is a martingale difference sequence (mds) with 




lim sup £;[||Luji:lr 1 * 4 - 1 ] < CO a.s. tor some p > 2. 
k~^oo 


e calculations that follow are well-known (see Bagchi 1993, for example). They are 
luded only for completeness. By A3, there is a unique solution to the following algebraic 
'cati equation in the class of positive definite matrices. 


S = A'SA - A'SB{Q2 + B'SB)-^B’SA -f- Qi 

(5) 

i the matrix. 


F = A~ B{Q2 -f B’SB)~^B'SA, 

(6) 

isymptotically stable. Define 


L = -{Q2 + B'SBy'B'SA, 

(7) 

00 

bk = -'£,(.FJyQixt+j = F’bk+i - Qixl 

j=0 

(8) 

dk = -(_Ql + B'SBr^B’bk+l. 

(9) 


ing (5), we have, 

A^-l 

X^Sxi^J — JCqiSxq = ^ Xj^Sxjc^ 

k^o 
N~1 

= X] ■*" BukYSiAxk + Buk) 

k=0 

N-l 

- X] x'l^iA'SA - A'SB(Q2 + B'SB)-^B'SA -H Qi]xk, 


X](^i<2i^jk -\-UkQ2i^'k) 
k=0 

N-l 

= XqSxq — x'j^Sxi^i -f X^ (u'l^B'SBuk + Ix'j^A'SBuk 
)k=0 


4 -x;L'(e 2 + B'SB)Lxk + u',^Q2Uk) ( 10 ) 


tnilarly, we obtain 


N-l 

h'^XN - b'^xo = ^ (b[^iXk+i - b^xk) 
fc=o 


A^-i 

= X] iK+i(^^k + Buk) - b'j^xkl 


( 11 ) 


N-1 


Jt^{u)= Ylix'i^QiXk - 2xl'QiXk + x^'Qixl + u[Q2Uk) 


k=o 


N-\ 


— XqSxq — Xji^fSx/sj ^ ^ {u^B SBuk ”t~ 2x^A SBu/c 
k=o 

+ B'SB)Lxk + u'^QiUk) + 2^>oXo - 2b'^XN 

N-\ N-l 

+ ^ + Buk) - b'l^xk] + (Xk QlXk - '2-Xk QiXk)- 

k=0 k=0 

( 12 ) 


From (8) we get 

K^k + Xk QiXk = b'k^^Fxk = b'k^^Axk + b'k^^BLxk- 


(13) 


Putting this in (12) yields 

(m) — XqSxq Xjy iSx/y/ + 2 qXo — 2b ^Xf^ 

N-l 

+ y~^ {u'kB'SBuk + 2x'}^A'SB uk + u'j^Qi^k 

k=Q 

Ax'kL'(Q 2 + B'SB)Lxk + 2b'k^^Buk - 2b'k+\^Lxk + 

N-l 

— XqSxq — x^Sx/v + 2i>QXo — 2b'j^X]\) + (x^ Qi^t. ~ ^k^ ^k+i) 

jt=o 

N-l 

A Y2iuk- Lxk - dk)'{Q 2 + B'SB){uk - Lxk - dk). (14) 

^=0 


Let U be the class of admissible conhrols which will be specified below. The point to note 
at this moment is that, whatever class U we choose for admissible controls, 


'L^4Q\4-d[B'bMl (15) 

A:=0 

Thus, if in the class of desired control laws we are able to choose a control for which 
limjv-^oo sup(l/iV) J;^(u) equals the right hand side of (15), then this will automatically 
yield the optimal control desired. 


3. Recursive estimator for B 


Let 


C adj (/ - zA) = C + Ciz 4- • • ■ + C„_iz"-\ 


(16) 
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lere “adj” stands for the adjoint of a matrix and Ci are m x « matrices, i = 1,..., n — 1. 
t 


0' = [CBlCiB‘:---‘:Cn-.iBl 

(17) 


(18) 


lere C B and CiB aiQ m x p matrices. 
Let us define 



~ C ■ 


" CB - 


Cl 

and n = 

CiB 


_ Cn-l . 


_ Cn—\B _ 


mma 1. With the notations above, 

B = 

oof All we have to show is that E is of full row-rank so that E'S is invertible. Suppose 
it 

Ex: = 0, xeR”. 
len 

C adj (/ - zA)x — 0, 
lich implies that 

C(/-zA)-^x = 0. 
len for |zl sufficiently small, 

C(/ ■+• zA -f- z^A^ -f- • • - jx = 0. 
om this we conclude that 

Cx = 0, CAx = 0,..., CA"x = 0. 

tice (A, C) is observable, we must have x = 0, establishing that E is of full row 


ik. □ 

We are now in a position to propose a recursive algorithm for estimating B. We first 
opose the following scheme to estimate 6 recursively: 

0k+\ = % + ykPk(t>k[ct{z)yk+\ - O'kA - (a(z) - (20) 

Pk+\ = Pk- YkPkMkPk, (21) 

yk = (lA(fy'kPkcl>kr\ ( 22 ) 

Wk+\ = oi{z)yk+i - - (a(z) - l)^^:-^!. (23) 


here z now stands for .the unit delay operator and Pq, wq are chosen arbitrarily. 
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Let us write 4 in the block matrix form 


where 9ki are m x p matrices, i = 1, and set 


n^ = 


^kl 


L &hi 


We propose the following recursive estimator for B : 

Bk = i^'i:r^x'nk. 


(24) 


(25) 


Theorem 1. Assume that conditions Al, A2 and A4 hold. Then for any Fk-measurable 
control Uk, 




/ log>-maiLW(loglogAmm(^))'^^^ 
\ ^min iX) 


(26) 


where A.inax(fc) cind denote the maximum and minimum eigenvalues of -P/.+p 

spectively, f is as defined in A4 and 


A(y 6 - 2 ) = f ^ ^ ^ 

[ c > 1, but otherwise arbitrary, if f = 2. 


Proof Using lemma 1, we have 

B-B;fc = (£'E)-'r'(n-nt), 

SO that it is sufficient for us to show that 

||0 — 0k-\-i 11^ = O / •^niax(^))^^^ 

\ ^min(^) 

Note that, with z denoting the unit delay operator, we may rewrite (1) and (2) symbolically 
as follows: 



Xk = iI-zA) ^Buk-i, 
yk~C{I — zA) ^ Buk—\ + Wk. 

Therefore, we get 

<^(2)y;: = C adj (/ — zA)Buk~\ + a(z)wk. (28) 

Using (17) and (18), we may express (28) as 

(^(z)yk+\ — 0'(f)k + a(z)u;jt+i. (29) 

Let us denote 
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Using (23) and (29), we have 

=6^(l)k+oi(z)wk+\ - Ok+\<t>k - («(^) “ ^)^k+\ - Wk+\ 

= 0^+1# - (oi{z) - 1)4+1, 

or, equivalently, 

aU)4+i = 0^+1#, 

where 9k+\ = 6 — 4+i- Using (20) and (23) we have 

wk+i =4+1 + ^k+i = ociz)yk+\ - - l)iA'^+i 

- (4 + ykPk(l^k(oi(z)yk+i - Ol,(pk - (oiiz) - l)wk+iyyfk 
= (1 - yk(PkPk4>k)(oi(z)yk+i - (odz) - \)wk+i - d'kM 
= yk(oi{z)yk+i - (a(z) - l)n^/t+i - 0^0/:)- 
Therefore, (20) may be rewritten as 

4+1 = 4 + Pk^k(^k+\ + M^^■+l)', (31) 

or, equivalently, 

4+1 = 4 - 40/t(4+i + w^A'+i)^- (32) 

The result now follows from theorem 2 of Chen & Guo (1991) by identifying ? 7 jt in that 
theorem with, 

oi(.z)yk+i - (oiiz) - l)wk+\ - O'l^fk - Wk+\, 

which is easily seen to be -measurable. U 


4. Consistent estimator for B 


The previous theorem shows that the estimation error of Bk depends upon the behavior of 
Pjf^- In general, we do not know whether Bk converges to the true B or not. To ensure 
strong consistency of the estimator of unknown parameters and achieve optimality of the 
control law at the same time is a very difficult problem. Direct certainty-equivalence based 
adaptive contol law cannot achieve this goal in the linear-quadratic problem (Polderman 
1989). In stochastic adaptive control literature the idea of diminishing dither to the control 
law has been introduced for this purpose (Caines & Lafortune 1984; Chen 1984), which 
will be used here. 

Let {fit} be a sequence of -valued random vectors which is independent of with 
Evk = 0, Evkv'j^ = I and Huitll < constant a.s. (almost surely). Define 



Vk 



6 e 


(33) 


Without loss of generality, we may assume that Tk = cr{wi, vi, 0 < i < k}. Set 

Tf. = a{wi, 0 < i < k, Vj, 0 < j < k — 1}. 

Let be any -measurable control law at time k, obtained possibly by some cer¬ 
tainty equivalence principle. We apply the diminishingly excited version Uk of to the 
system, 


Uk = + u^k- 

Theorem!. If Al,A2,A^ hold, and if 

= 0(A:'+*), 3 6 

i = l 


1 — len 


Then 


\B-Bkf = 0 


L ’ 1 + 2 A 2 
^ log k (log log k)^^^~-^\ 


a.s. 


O' € ( -(1 4-5), 1 4-5) 


Proof We first note that the interval for a is not empty, because 


n(6 4-5)-4- - < ni^ + 


1 — 2en 


4- 


1 — 2en 1 


1 -h 2n y 2(1 4- 2n) 2 

It is easy to establish (see equation (6.72) in Chen & Guo 1991) that 
k 


1 


^ ^ ^ / as TZ —oo. 


(=1 


Then 


(34) 


(35) 


(36) 


(37) 


^max(fc) — O J 

= o + = 0(k^^h. 

Using (26) and (38), we can establish (36) provided that 


lim 

A->co 


inf^ “^min 1 #0, 


.i=0 


(38) 


(39) 


where A.niin(2f) denotes the minimum eigenvalue of a matrix X. Suppose that (39) does 
not hold. Then there exists a sequence with 
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being p-dimensional vectors, such that 




lim 


l-^oo 


equivalently, 


(=1 


h 


lim -^ l^k/ ^9-»+i) — 

£-^oo 

j = l 

theorem 2.8 of Chen & Guo (1991), 


(40) 


1 = 1 
k 


j=l 


d' 


= vy > 0, V?; > 0, 

= V/ > 0. 


nee a > (1-1- 5)/2, we then have, 

[ i=l j=2 1 = 1 

lich, using (40), leads to 

h 


—y 0, k oo. 


^ '^^Pki 0, ^ CO, 


(41) 


1=1 




>^ 1 " E(4;’")++ • • ■+4f “--+1)" 


1=1 

om (37) and (41) it follows that 

hich, together with (35), implies that 

^-Cl+J)+l-*-« ^0, i ^ oo. 

1 = 1 

quations (44) and (42) imply that 

kg “ --» 0, k 00. 

i=l 

Comparing (45) with (40) yields 


l|p)‘?ir- = i = 1,...,«. 


(43) 


(44) 


(45) 


(46) 
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Noticing that a < 1 — n(6 + 5), we find that for every i,\ Si Sn, 

a < 1 - /(6 + 5) = 1 - e - 5 - (i - 1)(6 + 5) < 1 - 6 - (/ - 1)(6 + 5). 
Therefore, (46) implies that 

^ 0, foo, / = 

This contradicts our assumption that \\pke\\ = 1 for all f. This, in turn, contradicts (40) 
establishing our result. □ 


5. Optimal adaptive control 


Let us now go back to the adaptive control problem posed in § 2. It is clear from (14) that, 
if was completely observed and [8^} was known, the optimal control would be given 
by 


Uk=Lxk+dk. (47) 

We use the ELS-based estimator Bi^ for B and define the certainty-equivalence control 
by 

where 


^k+i — + BiiU^, 

XQ arbitrary, 

Lk = -iQ2 + B'tSkBt)-'B'i,SkA, 

Sk = A'Sk-iA - A'Sk-iBkiQi + B^Sk-iBkr'B'kSk-iA+ Qu 
■S'o > 0 , otherwise arbitrary, 

dk=-{Q2 - 1 - B'j^SkBkV^Bkbk+\, 

k 

^t=-T:FLQ>4^j. 

J=0 

Fk = A-BkLk. 


We now define stopping times {cr;t} and {rjt} as follows: Set ti = 1. Let 

l;-i 

< c j’ — ni+^ 4- iij,. 


(Tjt = sup 


t > Tjt 


Tk+i =inf 


t > aic 


E ^ O' - 1 )^+^ -f il<f, Vj 6 (r^, t)\, 

l=rk 

A+S t ] 


.1+3 


l=Xk 


l=CTk 


2^ 


(49) 

(50) 

(51) 

(52) 

(53) 

(54) 

(55) 

(56) 


rhe desired control law is defined by 

^ _ P?’ for some £, 

* ~ jo, otherwise, 

I finally, the adaptive control law we are after is given by 
+ Uf . 

: U denote the class of admissible controls defined by 
U = {u\uk is Tii — measurable such that the 

resulting state satisfies Hx^H^ = o(k) a.5.} 

eorem 3. Assume that A1 - A4 hold. Then [u\}eU, 

1 1 / 
lim sup—J a^(w*)= lim sup — V [x| 
N-^OO N N — 


9 /\ogk(\og\ogk)^^^ 

|B-B,|p = 0 - 


, Q! e (1 — nc 


lof. We first show consistency of 5^. If < oo and cr^ = oo for some t, then 
=1 llw'JlP = 0{k}^^) and the strong consistency of Bk follows from theorem 2. If 
< cx) and r^+i = 00 for some i, then u\ =0 Vi > ^ and, again by theorem 2, Bk is 
ongly consistent. 

Consider now that T£ < oo, cr^ < oo for all i. Then 


1 ^ 1 ^, 
i^l n<k<at i^i 


= ®“p E E 

Ti<k<ai i=cri 


+ E ii»?ii^ + ,E 

i—a£-i i~Z£+\ 


1 r ^-1 _l+5 k 

< sup p:fj 2^]-^+ ^ 

re<k<cre ^ ^ i=T£+l 

^2+ sup ^Ell"?ll^ 

xt<k<ai «- 

<2+ sup + ll^?£ll^) 

xg<k<cr^ ^ 

<3 + ^<3 + -E<4 

To 
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Therefore, ll^J ||^ = Oik^'^^) and theorem 2 again leads to the strong consistency 
of Bk. 

By theorem 3.4 and remark 3.3 of Chen & Guo (1991), 

Sk S,k oo, 

from which it also follows that 

Fk F, \\bk - bkW 0 and \\dk - 411 ^0, koo. 

Since F is a stable matrix and Fk -> F, there exists a p6(0,1) and a C > 0 such that 

llficft-i - fill Vi:. ( 61 ) 

We now show that there exists an Iq such that 

r^o < oo and = oo. (62) 

If or^, < oo and = oo for some then uf = uf Wi > ai and from (49) it follows that 

{xit} is bounded. Hence {m^} is bounded and by definition (56) t^+i must be finite. This 
means that cr£ < oo and T£+i = oo cannot happen for any 1. 

If o-£ < 00 and ti < oo for all i, then for Z6[r£, cr£_i], 

= Axi + Bi{LiXi -F dj uf) 

= FiXiBi(di -{-uf), 

so that 

i-Xi 

•^1+1 = 44-1 •' • Fxt^xt + 44-1 •' • F'i-j+lBi-jidi-j + uf_j) 

+ Bi(di uf). 

Using this and (61) we have 

114-+111 < 114,11 + Cl ^ pj 

7=0 

= Cp'-"«+ill4,ll + C2, 

so that 

114-4-1 f < 2C/)26-t«+i)llje^j|2 2C|. 

Therefore, 

cr£ —1 

114-t-lll C3ll4£ 1!^ + C4<7£, 

i=Tf—1 


at 

Ell^ilP<C3||i„l|2 + C4<7f 


i=X( 


— o {cF^ "^ ) as ^ 00 . 


or 
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s result, along with (48), gives us 

^ = o(cr^^^) asi oo. 

s means that for sufficiently large I, 

E < -r*. 

i=Te 

ich contradicts the definition of ai, since by (55) 

/=T£ 

US we cannot have < oo and ai < oq for all 1. 

Clonsequently (62) holds and 

* Old 

iq = + Mjt’ 

1 xjc+i = Fkh + Bkdk, vfc > TgQ. 

By (61) and the boundedness of Bkdk it is clear that [xk] is also bounded and so is 
that 

k 

E\\‘‘^f = ock). 

/=] 

is and theorem 2 imply (60). 

To prove optimality, notice that 

- Xk+i = A(xk - h) + (5 - Bic)ul. 

r the boundedness of {w^}, the stability of A and the fact that B — Bk 0 we find that 

^k ~ ^k —^ 0,k CO. (63) 

lis, along with the boundedness of {xk}, implies that {xjt} is also bounded. Therefore, 
1} eU. Using (63) and the facts that Lk — L 0, and \\dk — dk\\ 0, we conclude 
at 

, N-l 

- dk)\Q 2 + B'SB){ul - Lxk - 4) 0, koo. 

^ k=0 

ambining this with (14) establishes (59). ^ 

Conclusion 

fe solved a class of stochastic adaptive control problems in the state space form which 
ise in controlling aircraft flying in gusty conditions. The important, although difficult, 
Ltensions which should be further looked into involve the state noise case and/or when 
e parameters A and C also contain unknown elements. The approach presented in this 
iper does not directly go over to this most general situation, but may possibly be used 
lere in combination with some other techniques. 
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►ntrol of a 2-DOF manipulator with a flexible forearm 
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Abstract. In this paper we present experimental results on the position and 
vibration control of the end-effector of a 2-DOF parallelogram manipulator 
with a flexible forearm. A dynamic model of the manipulator is first obtained. 
Control strategies are implemented to control the manipulator. The first control 
strategy uses the computed torque method based on a reduced-order dynamic 
model of the manipulator which is obtained by assuming that all the links are 
rigid. This method is referred to as the reduced-order computed torque (ROCT) 
method. Experimental results demonstrate that such a strategy is not good for 
vibration control of the end-effector of the manipulator. The second control 
strategy is a state feedback control law designed based on a local linearization 
of the nonlinear dynamic model. Experimental results show that this control 
strategy achieves good vibration control of the end-effector of the manipulator. 
However, since the strategy is based on local linearization, it is valid only in a 
neighbourhood of the operating point. A hybrid controller that uses the ROCT 
method for the initial large movement of the manipulator is then implemented. 
Based on a switching rule, the controller is switched to the state feedback 
controller based on the linearized model when the manipulator is sufficiently 
close to the equilibrium state. Experimental results are reported and the suc¬ 
cessful performance of the controller in dampening out end-point vibrations is 
demonstrated. 

Keywords. Two-DOF flexible manipulator; reduced-order computed torque 
control; vibration control; hybrid controller. 


Introduction 

; increasing industrial need for higher productivity requires robots that move accurately 
[ rapidly. In order to achieve this, it is desirable to have robots with light structures 
t offer high-speed performance and low energy consumption. One way of designing a 
ot with a light stmcture is to use slender Links. Modelling and control of flexible-link 
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manipulators has been a subject of active research in recent years. Flexible-link manipu¬ 
lators have the advantages of being hghtweight and have a larger pay-load to weight ratio. 
They are also easy to fabricate and their design is simple. However the fact that the links are 
flexible introduces vibrations as the robot arm is in motion. Vibration suppression and posi¬ 
tion control in flexible-link robot arms is therefore very important. A number of researchers 
have studied the control design for vibration suppression and position control of a flexible 
robot arm (see Craig 1981, Book 1984, Cannon & Schmitz 1984, Hastings & Book 1986, 
Sakawa & Matsuno 1986, Matsuno & Fukushima 1987, Siciliano & Book 1988, Wang 
& Vidyasagar 1991, Siciliano et al 1992, De Luca & Siciliano 1993, Aoustin et al 1994, 
Gross & Tomizuka 1994, Khorrami et al 1994, Moudgal et al 1994, Vandergrift et al 1994, 
Zhu et al 1994, Banavar & Dominic 1995, Lin & Yih 1996, and references therein). Suc¬ 
cessful experimental results have also been reported for single-link and two-link flexible 
robot arms (Cannon & Schmitz 1984; Hastings & Book 1986; Sakawa & Matsuno 1986; 
Matsuno & Fukushima 1987; Aoustin et al 1994; Gross & Tomizuka 1994; Khorrami 
et al 1994; Moudgal et al 1994; Banavar & Dominic 1994). Modelling of flexible robot 
arms has been carried out in many papers (see Book 1984, for e.g.). It has been pointed 
out (Siciliano & Book 1988) that, if the links are stiff, one can use singular perturbation 
methods for the control design for vibration suppression and position control in n-link 
flexible manipulators. However the control law given by Siciliano & Book (1988) requires 
measurements of the link deflection rates which is not easily measurable. The control law 
is modified as an output feedback strategy by Siciliano et al (1992) and only requires the 
joint position measurements, joint velocities, and the link deflection, which are obtained 
easily, say, from strain gauge measurements. An alternative but similar control design 
method is also proposed by Vandergrift et al (1994) for vibration suppression and position 
control of n-link flexible manipulators. A method of control design based on a dynamic 
output feedback compensator is proposed by Craig (1981), which avoids the need for the 
velocity of the deflection. Sakawa & Matsuno (1986) derived a dynamic model for a 2- 
DOF parallelogram manipulator and designed a feedback control system for it. This was 
followed up by Matsuno & Fukushima (1987) who obtained several satisfactory results 
for point-to-point position control. 

The objective of this paper is to report experimental results on the position and vi¬ 
bration control of the end-effector of a 2-DOF parallelogram manipulator with a flex¬ 
ible forearm. A dynamic model is first derived. Control strategies are implemented on 
the experimental facihty. The first control strategy uses the computed torque method, 
based on a reduced-order dynamic model of the manipulator which is obtained by as¬ 
suming that ail the manipulator hnks are rigid. This method is referred to as the reduced- 
order computed torque (ROCT) method. The second control strategy is a state feedback 
control law designed based on a local linearization of the nonlinear dynamic model 
of the system. Finally, the third control strategy uses the ROCT control for the initial 
large movement of the manipulator. The control law is then switched to the state feed¬ 
back control based on a switching mle. This method is referred to as the hybrid control 
method. 

The paper is organised as follows. Section 2 describes the experimental setup. This is 
followed by the derivation of the dynamic model of the system in § 3. Using this dynamic 
model, three different control strategies are developed. Section 4 presents the experimental 
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Figure 1. The manipulator with a flexible forearm. 

Its of the ROCT method. In § 5, the state feedback control law and hybrid control 
are discussed and the experimental results are given. The conclusions are presented 
6. 

Experimental setup 

5 section provides a brief description of the experimental setup. The objective of this 
^r is to present experimental results on the position and vibration control of the end- 
ctor of a 2-DOF manipulator with a flexible forearm. Figure 1 shows a picture of the 
lipulator in the experimental facility. 

he manipulator is a 2-DOF parallelogram design similar to the one developed by 
Is & Lokhorst (1993). The last link (forearm) of the robot is flexible. It operates in 
horizontal plane and is driven by 2 Dynaserv direct-drive brushless motors so that 
problem of friction and backlash are minimized. The characteristics of the motors 
i in the experimental robot are shown in table 1. Note that the motors have high 
brmance, high torque output which prevents saturation, and high precision encoder 
dution. The servo drivers for the motors can be preset to operate in 3 modes of 
trol, namely, position control mode, speed control mode and torque/current control 
ie. In this experiment we preset the operation of the servo drivers to the torque/current 
trol mode. Thus a voltage between —8 V to 8V applied to the servo drivers allows 
motor to produce a torque between negative maximum torque and positive maximum 



le 1. Motor characteristics. 


ure 

Unit 

Motor model 

DM1200A 

DM1045B 

;imuin output torque 

Nm 

200 

45 

;d speed 

rev/s 

1.0 

2.0 

Oder resolution 

p/rev 

1024000 

655360 

ght 

kg 

29 

9.5 

3r inertia 

kgm^ 

0.167 

0.019 
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Figure 2. Schematic diagram of an active type PSD. 


torque, and the voltage-to-torque relationship is linear. Thus direct torque control is 
achieved. 

In this experiment, measurement of the vibration at the end-effector of the robot is 
essential. The Hamamatsu C2399-00 position-sensitive detector (PSD) is used for this 
purpose. This is an active type, opto-electric position sensing unit which measures the 
position of a single-point of light spot focused on the sensor head. It uses an infrared 
LED mounted as a target at the tip of the flexible link and the movement of the LED is 
measured by a sensor camera which is mounted as shown in figure 1. Figure 2 shows the 
schematic diagram of the active type PSD used in the experiment. The advantage of the 
C2399-00 PSD is that it uses a non-discrete, silicon PSD for two-dimensional position 
sensing. The non-discrete nature of the PSD allows for high accuracy measurement of 
position. The LED is pulse-modulated and optically filtered in the sensor head with a built- 
in background-light cancellation circuit, making accurate measurements possible even in 
bright locations. Since it does not require scanning, it provides good speed of response. 
The advantages of high accuracy measurement and good speed of response make it very 
suitable for vibration measurement. The rated position resolution of the C2399-00 PSD 
is 1/5000 and its sampling rate is 312.5 Hz. The sensor head of the PSD is a camera that 
uses a C-mount lens. The choice of a suitable lens is based on the following 2 factors: (1) 
the required angle of view which is determined by the dimensions of the scene, and (2) 
the minimum illumination level of the area to be monitored. 
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Figure 3 shows the controller architecture. A quadrature encoder input card along with 
the motor encoders are used for measurement of link angular displacements. The link 
angular velocities are then obtained by numerical differentiation. Inputs to this card are 
conditioned by a 4-stage digital filter. For the experiment, the encoder signals from the 2 
motors are input to the card, which provides the digital counts to the computer. The host 
computer is an IBM PC-compatible machine with a 486DX33 microprocessor with SMB 
RAM. A program written in C-language is used to control the manipulator. A menu- 
driven control panel allows the user to input control parameters such as desired end 
position of manipulator, sampling rate, and gain constants. At each sampling time, the 
computer reads the digital input data, performs servo calculations and writes digital output 
data. 

The D/A converter for the digital computer operates in bipolar mode and has a 14- 
bit resolution and conversion time less than 42p,s. The voltage output range of the D/A 
converter is —8V +8V and its outputs are connected to the servo drivers via an emergency 
stop circuit and a custom circuit. These are used for controlling the voltages to the two 
motors. The A/D converters provide analog-to-digital conversion for the analog signal 
from the PSD. The A/D converter has a 12-bit resolution and best conversion time of 
10 |xs. The accuracy and linearity of the converter are both +/ — I bit. Besides the D/A 
and A/D converters, the digital I/O card is also used as an interface to the computer. They 
are used to read the status of motors 1 and 2 respectively. An interface to the force/torque 
sensor is also available as shown in figure 3. This is used for conducting contact force 
control experiments using the manipulator. 


3. Dynamic model of the manipulator 

3.1 Manipulator characteristics and parameters 

Figure 4 shows a schematic diagram of the manipulator with a flexible forearm. It comprises 
4 rigid links labelled from links 1 to 4. Link 4 has an extension 4 that is used to clamp 
flexible link 5. All the links are made of aluminum alloy 5083. A position-sensitive detector 


V 



4 2 


X 


Figure 4. Schematic diagram of the ma¬ 
nipulator. 





camera assemoiy cnu assemoiy 

Length (m) /” = 0.0645 4 = 0.0525 ,15 = 0.4800 

Mass (kg) _ Me = 0.209 _ mi = 0.026 _ 

(PSD) is mounted on the manipulator. The PSD consists of a LED assembly which is 
mounted at the tip of link 5 and a camera assembly mounted at a distance Ic from the joint 
of links 4 and 1. 

Table 2 shows the parameters of the camera assembly and the LED assembly. Table 3 
shows the parameters for each link of the parallelogram manipulator. 


3,2 Free transverse vibration of Bernoulli—Euler beam \ 

In order to derive the dynamic equation of the manipulator, which comprises a flexible 
forearm labelled as Link 5 in figure 4, it is essential to know its vibrational characteristics. | 

Assuming link 5 is a Bemoulli-Euler beam, its vibration is governed by the Bernoulli— : 

Euler beam equation given by 


du^ 



+ p 


d^v 


0 , 


( 1 ) 


where 0 < m < /s, p is the linear density of the beam, E is Young’s modulus and I is 
the cross-sectional area moment of inertia of link 5. The values of p and I for link 5 are 
0.244kg/m and 6.75 x 10“^ ^ m^ respectively. The assumed-modes method is used in our 
analysis. The deflection of link 5 is given in separable form as 


c» 

v{u,t) = Y](f)i(u)qi(t), (2) 

1=1 

where (f>iiu) is the ith natural mode eigenfunctions, and qi(t) is the time-dependent gen¬ 
eralized coordinate. The above equations are solved by approximating the natural modes 
of flexible fink 5 by the natural modes of a uniform clamped-free beam with boundary 
conditions given by (Craig 1981) 


Table 3. Parameters of each individual link. 



Link 1 

Link! 

Link 3 

Link 4 

Link 5 

Width (mm) 

Height (mm) 

Length (m) 

Centre of mass (m) 
Mass (kg) 

Moment of inertia 
(kg/m^) 

30 

15 

h = 0.40 

Ici = 0.058 
mi =2.905 
h = 0.079 

30 

15 

h = 0.35 
lc2 = 0.090 
m 2 = 1.505 
(2 = 0.031 

30 

15 

I3 = 0.40 
/c 3 =0.195 
m 3 = 0.877 
h = 0.023 

30 

15 

k = 0.35 
/c 4 = 0.173 
m 4 = 0.858 
h = 0.019 

3 

30 

I 5 = 0.48 

1,5 = 0.293 
ms = 0.117 

I 5 = 2.24 X 10-3 
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►le 4. Natural frequencies and related constants of link 5. 


de i 

Mode 1 

Mode 2 

Mode 3 

Mode 4 


1.87510 

4.69409 

7.85476 

10.99554 


0.734] 

1.0185 

0.9992 

1.0000 

^ 5 ) 

2.00 

-2.00 

2.00 

-2.00 

rad/s) 

69.64 

436.49 

1540.89 

3019.55 

Hz) 

' niL = 0) 

11.08 

69.47 

245.24 

480.58 


1 ^ dv 


d^v 

v|«=o = 0, — 

au 




3 natural mode shape eigenfunctions and the natural frequencies are given by (Craig 

51 ) 

01 (w) = [cosh fill — cos fiu — ai (sinh fiu — sin fiu)], (4) 

CDi=fif(Ellpf\ (5) 

ere 


1 + cos fil^ cosh fils = 0, (6) 

cos )S //5 + cosh 

Gi = -, (7) 

sm fils + sinh fils 

1 j = 1, • • • oo. 

Fable 4 shows the natural frequencies and related constants for the first 4 modes of 
k: 5. Note that the natural frequencies listed in table 4 are for link 5 without point mass 
ts tip. 


Manipulator dynamics 

e method used in the dynamic modelling of the system is the Lagrange-Euler formula- 
n. In this method, the computation of the kinetic and potential energy of the system is 
integral part of the formulation. As the detailed derivation of the dynamic equations is 
•y long and tedious, only the relevant equations are presented here. For more details on 
; derivation, please refer to appendix A. 

The full physical system comprises links 1 through 5, 2 direct drive brushless motors 
1 a camera-LED assembly mounted on the arm. The dynamic model of the system is 
^en by (please refer to appendix A) 

D(Q)Q + GQ-F H(Q, Q) + ^, 62 ) = PT^, 

Q — \fl^2^lQ2 * ■ ■ ^ 00 ] i Ttn — » 


( 8 ) 


D{Q) 


d\ + 


M 2 \ 


Mi 2 


dl + Jm2 + Ph dif 

j=l 


+miY^4,i{l^)qi 




X cqs (02 — ^l) 


X cos(02 — ^l) 


ai +mL{ls+l^)(j)\{h) a 2 + miHs + l 5 )((> 2 ih} 


7=1 

(^-^Ph->nLll<l>^ «!+mL(4+/5)^l{^5) p /5 + (/s)^ mL(pl{l5)4>2(^h) 

X cos(02 — ^1) 

^-Jp-p/l -/n/,/i02^ 012 + m lils + ls)(j>2(l5) >nL(piils)(j)2ih) p/5 + mi,02(^5)^ 

X cos(02 — 0i) 


G = 


0 0 0 

0 0 0 

0 0 phcol 

0 0 0 


0 

0 

0 

plsay^ 


A (^ 1 ,^ 2 ) = 


'Bmi6i + btni sgn{9i) ~ 


'1 0 “ 

BmlOl + bml sgn( 02 ) 


0 1 

0 

, P = 

0 0 

0 


0 0 

: 




H(Q,Q) = 

-^3 sm(02 - 0i)0^ + 4p/i02 sin(02 - V 5- Qi 

/=i 


+ 2 mLh sm(02 - 0 i)d 2 ih)qi 

1=1 


+ I —qi+mih 


oo \ 

^<l>iil5)qi j COS (02 - 0\)02 



<^3 sin(02 - 0l)0f + 2yO/502 52 22 22 ^J (^5)^y^2 

; = 1 /=] ; = 1 

{ ^ O' ^ 

- 2/0/i ^ ^ qt +/72l/i 


oo \ 

Yl^i^^5)qi I 


COS (02 — 


.•=1 /=] ; 

2 ^ 0 /i + mL/i0i(/5)^ sin (02 - 0i)0? 


plsqi +mL 


f^^<t>i(is)‘!i\ 01 (/sH e| 


(J? 


2-^p/i 4- mL/i02(^5) j sin(02 - 0i)0f 

^p/ 5^2 + WL |^22‘i^'(^5)0/j <^ 2 (^ 5 ) j 0 , 


d\ = I\ -\- in\l~Y + 73 + yn.'^l~ 2 ) + 77X4/] + 1 Tlsl\ + 772 c(j + nxjr^/j, 
d 2 = I2 ^ 7722 ^c 2 "t" -^4 + 7723/2 + 7724/“4 + ^ 7725/5 + 

+ 7725/5/5 H- 772 c/c + mi(ls + Is)'^, 

d^ = m^hlc^ - 5 7725/1/5 + 7724/i/c4 - mslsh - wl/i (4 + /s) + mchlc, 
M\2 = M21 = 2/3 cos(02 — 0l) 


+ 


2p/i X] X 

7 = 1 i=l 


sin (02 - 0i), 



'^ote that the vector denotes the vector of motor torques, Jmi, Jmi denote the rotor 
rtias of the motors which is given in table 1, jB^p Bjn 2 denote the viscous damping 
fficients of the motors, and bm^ > ^m 2 denote the friction coefficients of the motors. The 
aes for all the other parameters in the dynamic model (8) may be obtained from the 
a in tables 2, 3 and 4. 

n(8), D{Q) is the inertia matrix, G{Q) is the stiffoess matrix, H{Q, Q) is a matrix con- 
ling nonlinear centrifugal and coriolis terms, and ju,(0i, 02 ) is the friction and damping 
trix. It should be noted that coi in the stiffness matrix refers to the free natural fre- 
mcy of vibration of a clamped-free uniform Bemoulli-Euler beam. It does not indicate 
frequency of vibration of flexible link 5. The Rayleigh’s method for approximating the 
idamental frequency of a continuous system is presented in § 4. This method will be 
id to estimate the frequency of vibration of link 5. 


4. Control design based on reduced-order computed torque method 


In this section, a controller is designed based on a reduced-order dynamic model of the 
manipulator using the computer torque control design methodology. This method is re- 
fen'ed to as the reduced-order computed torque (ROCT) method. For control of a fully 
rigid manipulator arm, it is only necessary to control the rigid modes of the manipulator 
and the computed torque method can be used effectively for this purpose. An attempt to 
use the same model-based control law in a manipulator with link flexibility may result 
in deterioration in perfoiTnance because the controller has been designed to only control 
the rigid modes but not the flexible modes of the manipulator. The ROCT method may be 
used to control a flexible-link manipulator with minimal deterioration in perfonnance, if 
the discarded terms in the model-based control law have only a negligible effect on the 
dynamics of the system. For example, friction, viscous damping or flexibility in the system 
may be ignored if the system is well lubricated or is fairly rigid. 

The objective of this experiment is to study the deterioration in performance, if any, of the 
controller designed based on the ROCT method. The physical system is described by the 
matrix equation given by (8). Assuming link 5 is rigid, q\, qi, ■ • ■, ^oo and qi,q 2 ^ ■ ■ ■ ^ Qoo 
are set equal to zero in (8) so that the resulting equation is the dynamic model of an 
equivalent ‘rigid’ manipulator. This equation is given by 


-1 

_1 


\j-m2. 



+ Jml 


d^ cos(02 — 0l) 


^2 + Jm2 

.^2_ 


Bml 0 

\0i] 

4- 

■^;„isgn(0i)‘ 

0 B„2\ 

A. 


>m2Sgn(02). 


—d^ sin (02 ~ ^l )^2 
d-i sin (02 — 

(9) 


Based on the reduced-order dynamic model given by (9) and ignoring the coulomb friction 
and viscous damping terms which are not known exactly, a control law is obtained based 
on the computed torque method which is given by 


Twi2. 


M(0i,02) 


^}d + kp\e\ -f kv\e\ + ki\ Jq e\(^t 

Qld + kp2e2 + ^v2^2 + ki2 Jq e2dt 


+ h(9u02JiJ2) 

( 10 ) 


where 


^ (01, 02) = 


Jm\ 

<^3 COS(02 — 0l) 


d-T, COS(02 — 01) 
d2 + Jm2 


h(9i, 02, 01, 02) = 


-dz sin (02 - 0i)0j 
£/3sin(02 - 0i)0f 


^1 =9id - 01 


^2 = 02d — 02 


The control law given by (10) is implemented to study position and vibration control of 
the end effector of the manipulator. Figure 5 shows the controller based on ROCT method 
which has been implemented on the system. Figure 6 shows the experimental results of 
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Fieure 6 . Experimental results for ROCT. 









Figure 7. Relative magnitude of (/ 5 )(?i (0 and <p 2 ih)^ 2 {t)- 


the ROCT method. Note that the desired state of the system is defined as 9{ = 2.65rad, 
62 = 3.99rad. The gains are set to Kp\ = 100, Kv\ = \ and Kp 2 = 50, Kv 2 — 4.25. 

Figure 6 shows that there is deterioration in the performance of the controller based 
on the ROCT method. Although the controller is able to control the rigid body modes of 
the manipulator, it is unable to control the vibratory modes of flexible link 5. Thus it is 
necessary to use a different control law for position and vibration control of the manipulator. 

Rayleigh’s method for approximation of the fundamental frequency of vibration is used 
to estimate the first fundamental frequency of vibration. Rayleigh’s quotient is given by 


CO; = 





1/2 


( 11 ) 


Since £/0j'(u)^du = cofpls, p(j)i{u)'^du = pis and all the other parameters 
are known, the first fundamental frequency of vibration can be estimated. The parameter 
values were substituted into (11) and (o\ was found to be 8.09 Hz. From figure 6 , it is 
observed that the natural frequency of vibration is about 7-8 Hz, which is very close to <z)i. 
Hence, it can be deduced that, for the experimental system, the dominant mode of vibration 
is mode 1. This can also be verified by checking the relative magnitude of < 5^1 il 5 )qi it) and 
(p 2 {l 5 ) 92 it). Figure 7 shows the relative magnitude of 0i {l 5 )qi (t) and 02 (^ 5)^2 (t) obtained 
from simulation. From the plots in figure 7, it is obvious that 0i (l 5 )q\(t) is large compared 
to (t> 2 {h)^ 2 it). The deflection at the tip of link 5 can thus be approximated by 


u(^5,O = 0i(/5)9i(O- 


( 12 ) 


This is because mode 1 is the dominant mode of vibration and the effects of mode 2 and 
other higher order modes are negligible and can be discarded. 
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Control design based on linear state feedback method 

controller based on the reduced-order computed torque (ROCT) method has been pre- 
ed in the previous section. In this section a control strategy based on local linearization 
le nonlinear dynamic model of the physical system and linear state feedback control 
hod is presented. Then another control strategy, referred to as the hybrid controller, 
eh combines the positive characteristics of both the ROCT controller and state feedback 
trol law is presented. 

.ecall that the matrix representation that describes the dynamics of the experimental 
em is given by 

D{Q)Q FGQ + H(Q, Q) + /r(0i, ^ 2 ) = PTm- (13) 

as been deduced in § 4 that the dominant mode of vibration for the experimental sys- 
is mode 1. The effect of mode 2 and other higher order modes is negligible so that 
' may be ignored. In general, for other systems, more than one mode may be required 
live a reasonable approximation for vibration. When more than one mode is used, 
all states are measurable based on measurements from the encoders and the PSD. 
refore an observer must be constructed so as to estimate the unmeasurable states. In 
1 a case, the design of the controller becomes more complex because if the system 
f nth order, the observer is also of nth order for a full order state observer. The re- 
ing closed loop system becomes of order 2n; that is the complexity and size of the 
alem is doubled. Furthermore, some deterioration in performance may be observed 
le dynamic response of the observer is not good. If only a 1-mode approximation is 
d, it eliminates the need for an observer which is clear from (12). For the experiraen- 
5ystem, a controller based on this model is implemented and satisfactory results are 
lined. 

)efine the state vector to be jc = [6i92Cii0i92q]^ and the equilibrium state to be xq = 
920000]^, where 9^ and 9^ are the desired joint angles. The linearized equations 
he experimental system with 1-mode approximation for the flexibilty is defined as 
ows. 


8x = A8x -)- B8u, 

8x = [ 50 ] 502 ^01 ^^2 ^ 01 ]^5 

A^i ^ A 

V-o(e)io‘G oj’ [dcoio*/ 

a 3 X 3 null matrix, / is a 3 x 3 identity matrix. 


8u — 8Tm — [T^ml^m2] 


P = 


nT 


and 


Dmo = 

^ d\ + J,fii d-T, cos{92 — 0^) - mL/i0i(/5))'^ 

x(^o _ ^0) 

d 3 cos(02 — d2 + Jm2 Oti + + l5)4>lih) 

(-|p-/0^1 " Cii + miils + l5)<l>lQ5) ph 

^ X COS(02 — / 

The state feedback controller for the linearized system is implemented and experimental 
results are obtained. The control law is given by 


^2 


-el 


^m\ 

Jml, 


= -K 


<?i 

^1 


0 i 


L qi J 


The feedback gain matrix is chosen to be 

[34.63 -0.58 16.50 6.33 0.01 1.01 ‘ 

0.29 7.07 -7.56 0.14 1.88 -0.64 


which gives the following closed loop eigenvalues for the system as 


e = 


"-0.92±53.07r 
-6.30±5.43i 
_-4.00±3.75i _ 


Figure 8 shows the experimental results of this controller, in which the desired state of the 
system is defined as 

6l?=2.65rad, = 3.99rad, = 0, 0? = 0, ^[ = 0, 4? = 0. 


Comparing figures 8 and 6, it is clear that slower response is observed for the rigid body 
modes using the state feedback control based on the linearized state space model in com¬ 
parison to that obtained by the ROCT method. However, vibration at the tip of link 5 is 
suppressed after a very short time. The experimental results show the effect of the state 
feedback controller designed based on the linearized system obtained by a local lineariza¬ 
tion of the nonlinear dynamic model, given in (13), about an equilibrium state. When 
nonlinearities in the system are not severe, local linearization may be used as an approx¬ 
imation to the nonlinear model. However, such manipulator control is not well suited for 
the manipulator making a large movement. An alternative is to use gain scheduling, in 
which the controller gain matrix and the equilibrium state of the manipulator change with 
the manipulator as it moves. 
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Figure 8. Experimental results for feedback 
control system based on linearized state space 
model. 


hybrid controller is proposed as another alternative to control the flexible-link manipu- 
The hybrid controller comprises the controller based on the reduced-order computed 
le (ROCT) method for the initial large movement of the manipulator, i.e, from the 
.1 state to a state sufficiently close to the desired state. When the manipulator is close 
j desired state, the controller based on the ROCT method is switched to the linear state 
)ack controller. A switching rule is used for such purpose. Figure 9 shows the exper- 
tal results for the hybrid controller. The initial state and desired state of the system is 
lentical to that defined in ROCT and state feedback control. The same gains are also 
. The switching rule is to switch from ROCT controller to state feedback controller 
i both 6>i and 62 are within 10° from the desired angles. 

le experimental results show that there is significant improvement in the performance 
e controller in terms of speed of response of the angular positions and vibration sup- 
ion at the tip of the manipulator. The use of the ROCT controller for the initial large 
iment of the manipulator enables fast response for the angular position. When the ma- 
[ator moves to a state sufficiently close to die desired state, where the local linearization 
e nonlinear system is valid, the controller is switched to state feedback control. This 
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time (s) 


controller. 


6. Conclusion 

In this paper, experimental results on the position and vibration control of the end-effector 
of a 2-DOF manipulator with a flexible forearm have been presented. The dynamic model 
is obtained using the Lagrange Euler formulation and by modelling the vibration using 
the assumed-mode method. Control strategies are implemented to control the manipulator. 
The first control strategy uses the computed torque method based on a reduced-order 
dynanuc model of the manipulator obtained under the assumption that all the links are 
rigid. This method has been referred to as the reduced-order computed torque (ROCT) 
method. Experimental results show that the ROCT controller is not suitable for position 
and vibration control. However, the ROCT method is good in controlling the rigid modes 
(angular positions) of the manipulator. The second control strategy is a state feedback 
control law designed based on a local linearization of the nonlinear dynamic model about 
an equilibrium state. Experimental results show that the performance of the state feedback 
control law is good and the vibration at the end-effector of the manipulator is damped out 
effectively. However this controller is valid only for small movement of the manipulator 
about the equilibrium state because linearization is done locally. The third control strategy 
combines the positive characteristics of each of the two controllers described above. This 
controller, referred to as the hybrid contioUer, uses the ROCT method for the initial large 







movement of the manipulator. Based on a switching rule, the controller is switched to the 
state feedback controller when the manipulator is sufficiently close to the equilibrium state. 
Experimental results show good speed of response and effective vibration suppression at 
» the end-effector of the manipulator. We mention that a one-mode approximation of the 
flexible modes was used in our experiments in control of the 2-DOF manipulator. If more 
than one mode of vibration is dominant in the system, then for vibration control one must 
design a state estimator for estimating the flexible modes of the manipulator based on the 
deflection measurements obtained using the PSD. 

Appendix A. 

In this section we provide the detailed derivation of the equations of motion for the 2- 
DOF manipulator with a flexible forearm. The full dynamic model of the system includes 
modelling of manipulator, modelling of point masses and modelling of the motors. The 
5 following assumptions are made on the flexible link 5 (see figure 1), in the derivation of 
the dynamic equations of the manipulator: 

(1) Link 5 is linearly elastic and vibrates horizontally. 

(2) Shear deformation and rotary inertia of link 5 are negligible. 

(3) Deflection of link 5 is small and change in length of the beam is negligible. 








potential energy of link 5. 

Lagrange’s equation is given by 

d /dT\ dT dU 

^ -T- + 7 -= ^ = 1,2, 3,..., 00 , (Al) 

dt \dzkJ ozk ^Zk 

where zk is the generalized coordinate and Fk is the generalized force. The velocity of an 
arbitrary point R on link 5 (see figure 1) is given by 


-l\ sin 9i (4 + u) sin $2 
1 1 cos 61 —{Is 4- u) cos 62 


' _L r ^ 

. [Oli - 


sin02(9i^/9O + VO 2 COS 62 
-cos 92 (dv/dt) + D 02 sin 02 . 


The kinetic energy of links 1 to 5 are given by 
Ti = ^h9^ + ^mil^i9l 

T 2 = 2 / 2^2 + k^2lc2^h 

T 3 = + ^2^2 + 2/2/c 3^1^2 COS(02 — ^l)]> 

74 = \U9l + \m4[lj9^ + 4^1 + 2hlcAB\02 cos(02 - ^ 1 )], 

^5 = -^ liL + ul/m 

1 / . . dv • 

= -p / I -h9i sin 9i + (4 4 - u)9 sin P 2 4- sin 92 — + v 92 cos 92 

2 Jo \ dt 


+ cos 9i — (Is 4- u )92 cos 92 — cos ^2 -^ + v 92 sin 92 J du. 
Since link 5 is a uniform beam, 

ms = pis, 

rh ^ rh ^ 1 _ 


h = udm ~ J u^pdu = -msly 


Moreover, v(u,t) = X!£i 0/(w)9i(0. therefore by expansion and simplification we get 


^242 1 Irr. j ^.i2 


Ts — -msliOi + -[4 + msls + mslshWi 

1 . . 

-2^5Uih + 2lils]9i92Cos(92 - 9i) 

J -/j 00 00 

“^2^ Jo 




4- 2 —l\9\ cos(02 - ^i) (f>i{u)qi(t) 
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+ h6]^sin(e2 - 6\)'Y^4>i(u)qi(t) 


+2 fc+,«)e2E <l>i{u)qi{t) du. (A6) 

/=l 

In the experiment, a camera-LED assembly is mounted on the manipulator. The effect of 
the LED assembly, mounted at the tip of flexible link 5, is to decrease the natural frequency 
of vibration of link 5. For modelling accuracy, the camera and LED are modelled as point 
masses. The following kinematic relationships are used in the derivation. 

'xcl _ [h cos 6i + Ic cos 62' 

ycj ~ LA sin^i +/csin02 J ’ 

"icl _ r —^lA sin0i02Asin02 
jcj "" L^iA cos^i + 02Acos02j ’ 

'Xe~\ _ pi cos 61 — (A + I5) cos 02 + sin 02~ 

_ye_ ~~ Ji sin^i — (A + 15) sin ^2 — cos ^ 2 ] ’ 

~xf _ sin^i+^ 2 (A + A)sin 02 + i^(A)^ 2 COS 02 + ^^(A)sin 02 
_ye_ A cos 01 — 02(A + A) cos 02 4 - i’(A)^2 sin 02 — ^(/s) cos 02 J ’ 

Note that (jc^, yc) refers to the location of the camera and (xg, ye) refers to the location 
of the LED, which is at the tip of link 5 (see figure 1). The kinetic energy of the sensor 
camera is given by 

Tc^^mdXc +yc) 

= \mc[l^e} + I'^e^ + 2AA^i^2COs( 02 - 0l)]- (A7) 

The kinetic energy of the LED is given by 
Tled = +yg). 

This can be simplified as 

j “ 00 

Tled = ^mi i\el + (h + hfel + Bl E <l>i (') 

^ L j=i 

00 00 00 ~ 

j=\ i=l J=1 

— miliQs + / 5 ) 0102 COS (02 — 0 l)+f^L 
00 

X h sin (02 - O\) 0 i 62 Y^(l>i{l 5 )QiiO 


+ (A 4- A)^2 ^ (A)4i (0 


4- mi -h cos(02 - 0i)0i ^<?!>i(A)<7i(0 


^ iO 


I\un J.ULK Ljyc CL UL 


From (A2)-(A8), the total kinetic energy of the system is given by 
5 

T = YlTi + T, + Tled- 


/ = 1 


The eigenfunctions satisfy two orthogonality conditions (Hastings & Book 1986). The 
orthogonality conditions given by Hastings & Book (1986) are used to simplify the above 
expression for kinetic energy of the system. The orthogonality conditions are 


p^i(f)jdu 


f 




\ 0, if i ^ j\ 

pis, if i = }, 

[ 0 , 

[(ofpis, if i = j. 
where 0; (u) is the assumed-mode defined by (4). Let 

^1 = + ^3 + + m 4 lf mslf + rucif + milf, 

dl — h-^ + A + m'slf rriAlf^ -f \msl\ + mslf + mshh 

-hmclf +mLils -{-hf, 

ds m'ihlc'i — \msl\ls + m^ldcA - mshh — mihijs + h) + mchlc- 
It can be shown that 

'h 


and 


^ u(j)i{u)du^ 2 /fi'f, 

f (^i(u)du= f coshfiiu — cos fiiu — cr/ (sinh ^iu — sin ^iu)du 
Jo Jo 


— 2cjj / . 


f Qs + m)0j («)dM = 2(4 ^ -f ^ j = ^ , where at = 2p (is ^ ) • 

Jo \ Pi Pf/ P \ Pi Pfj 

Using the orthogonality conditions and the relations mentioned above, the total kinetic 
energy of the system can be simphfied as 

T = -d\9f -b -jdjOf 4- ^3^102 cos(02 — ^l) + -/ 0/5 qf{t) 

i—\ 

1 00 °° / O'- 

+ - 2pl\9\ cos (02 -9\)Y\^i 

i=i i=i ^ P^ 

^ / O'- \ ^ 1 

2ph9i62Sm(e2 - ei)Y(qi(0^ ) +02y'(4/(O«i) + 

/=1^ Pi'' i=l 2 


^2 £<^;(/5)^;(0+ ^^ 5 ) 47(0 

. '=1 7=1 i=l 7=1 


+ 


X 



+ mL 


+ mi 


ll sin (^2 —^l)^ 1 ^ 2 ^(^ 5)^1 (0 + Gs-^hW2^4>i(h)^i(t) 

i=l 


i=l 

00 


-h cos (^2 - ^1)^1 Yl(l)i(h)qi(t) 

i=\ 


(A9) 


:e the manipulator moves in a horizontal plane, the total potential energy consists of 
/ the elastic potential energy of link 5. The potential energy of the system is given by 

2 


U 


U 


1 / 9 ^ 1 ; \ 

= - / Eli -I 

2 Jo \du?-j 

^IeiI 
2 Jo 


du, 


^ (l>i(u)qi (t) ^ (t)j(u)qj (0 

/=l 7=1 


dw, 




(AlO) 


t = l 


ng the kinetic energy and potential energy expressions (A9) and (AlO), the Lagrange 
ation given by (Al) can now be used to find the equations of motion of the manipulator. 

generalized coordinate, z\ ~ 0\ 

d\9\ + ^3 cos(02 ~ ^ 1)^2 ~ ^3 sin(02 ~ ^ 1)^2 

-2pl\ cos(e2 - ^i) y'9i(0-3^ + ‘iph6^s^a{B^ - 

ft Pi 

00 

+ 2 p/i [02 sin (02 ~ ^ 1 ) + ^ 2 ^ 08(02 — 

j=l 

00 

sin(02 — 0i) ^0/(^5)0'/^2 + 2mL/i sin(02 — 0i)02 X/0j(^5)0j 
/=! »=l 

00 00 

-{■mill cos (02 — 0l) 'y^4>i{h)qi^2 ~ cos (02 — 0l) ^0/(^5)^/ = 

i=l 

(All) 

• generalized coordinate, zi = 02 

oo 

^202 + d^ cos(02 — 0 i) 0 i + d^ sin(02 ~ 0 i)^^ ~t“ T "! Qi (Ooff 

1=1 

°° O'" °° 

+ 2p/i [01 sin (02 - 01 ) - 0f cos (02 - ^i)] (0“ + P^5^2 X] 

i=l 

00 oo oo 

4 - 2 p /502 ^ qi {t)qi it) + mz, X] 0i (^5)^i X! (^ 5 ) 07^2 

i=l i=l 7=i 

CO 00 ^ 

H-2mL X]0i(/s)^! + mill sin(02 — 0i) y^,0t(^5)0t^i 
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-mih cos (02 - 0\)Y^(t)i{ls)qi6^ + miijs + /s) (^5)^j = ''^ 2 - 


/=l 


i=\ 


Fo7 ‘ generalized coordinate, Zk-\-2 = qk (k = 1,2, 3,... oo) 

—-^ph cos(02 - &i)d\ + oqOi + phqiit) + ph{o^} - 02)qi(t) 

Pi 

20” 

- '^'PhO] sin (02 - 0l) + + 02(^5)<?2) 

Pi 

+ mi,(ls + l5)4>iQ'5)^l — tnih cos (02 — ^l)0/(^5)^l 

-mih sin (02 - ^l)0i(/5)^f - mL(<^l(/5)<?l + 4>2{h)q2)^ih)0l = 0 . 

The manipulator is actuated by 2 motors and therefore the motor dynamics must be mod¬ 
elled as well. The 2 motors used in this experiment are direct drive brush-less motors. 
Assuming only friction and viscous damping are present, the dynamic equations of the 2 
motors can be written as 


/fnl 0 

r?'i 

_i_ 

Pml 

0 ■ 

pl 1 

1 

■Z?;„isgn( 0 i)‘ 



_ 0 Jm2_ 

.^2. 


_ 0 

Pm2_ 

A. 


>;„ 2 Sgn( 02 ). 


J-m2_ 


ri 

T 2 J ’ 

(A12) 


where 3^2 are the rotor inertia of the motors, viscous damping 

coefficients, ^^p ^m 2 are the friction coefficients, and r^p tm 2 the motor torques. 
From (All) and (A12), the full dynamic model of the system can be obtained. This is 
given by 

D(Q)Q + G(2 + H{Q, Q) + niOx, 02) = PTm, 

Q = {0\02q\q2 • • • ^oo]^, = [r;„iT^2]^, 


D{Q) = 
r ^1 + -^ml 


M21 


^12 (^"Tp-pfi 

X cos(^2 “ ^ 1 ) ^ cos(02 

CO 

^2 + Li2 + Ph 

1=1 

“1 miils + l^)4>\il^) 0i2+mi{ls +ls)<t>2{h) 


X cos (^2 -^]) 

X 008(^2 ~®l) 


+mi 

i=l 

00 

j=l 

“2 +'”Ldi 4-/5)^(/s) 


pl^ + 


wiZ,</»ld5)<?^2d5) 


p/5 +m/.(/) 2 d 5 )^ 
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0 0 0 0 

0 0 0 0 

0 0 plso)\ 0 

0 0 0 plsa)2 


1 ^01,02) 


'BmlOi+bml sgn(0i) 


-1 0 ' 

Bm202 + bm2 Sgn(02) 


0 1 

0 

, P = 

0 0 

0 


0 0 
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-• • - 


H(Q,Q) = 


-d2 sin(02 - 0i)^2 + ^PhOl sin(02 " ^l) XIT 

i=\ 

CX5 

^Imih sin (02 - 9\)02Y^M5)m 


1=1 


+ I 2p/i X ^ + f^LhYl4>iih)^i I cos(02 - 0i)02 


i2 


i=l 
oo 


00 


sin (02 — 0l)0f + IplfiOl X X X 

i=i 1=1 ;=i 


2pl\ ^ ^ ) cos (02 - 0l)0f 


/=1 


- (2^pli + miliMs)^ sin (02 - 0i)0f 


- Phq\ + fni 


oo \ 


1=1 


01 (/5) Ol 


- ^2^p/i + mLh4>2(h)j sin (02 - 0i)0f 


- Phqi + mi 


oo \ 


02(^5) I ^2 


i=l 


di = I\ mil^Y + /3 + W3^c3 ^4^1 "i" ^5^1 ■1" ^cl\ "i" » 

d2 = I 2 + " 12/^2 + ^^4 + ^3^2 ^4^c4 5 ^5^5 

-^-mslsh + ruclc + miils + 
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d3 =^ 3 / 2/03 - l^Jnshls -\-m4\lcA -^54^1 -mihih +h) + mc44, 
Mi2 = M21 = ds cos(02 — ^1) 


+ 


Y1 T. + ^LhYl^i^^5)qi 

i=l i=l 


sin (02 - ^ 1 ). 
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le actor-critic algorithm as muiti-time-scale stochastic 
proximation 
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Bangalore 560012, India 

Abstract. The actor-critic algorithm of Barto and others for simulation-based 
optimization of Markov decision processes is cast as a two time scale stochastic 
approximation. Convergence analysis, approximation issues and an example are 
studied. 

Keywords. Actor-critic algorithm; stochastic approximation; Markov deci¬ 
sion processes; simulation-based algorithms; policy iteration. 

Introduction 

rkov Decision Processes (MDPs) have been a popular paradigm for sequential decision 
dng under uncertainty. The traditional approach has been to write down the dynamic 
gramming equations appropriate for the problem at hand. Their solution yields the 
called value function for the problem. The optimal policy as a function of the state is 
1 prescribed as the minimizer of a certain ‘Hamiltonian’ defined in terms of the value 
ction. This, in fact, gives a complete characterization of optimal (stationary) policies, 
ce everything hinges on computation of the value function, several iterative algorithms 
e been proposed for the same. These fall broadly into two classes ; value iteration and 
icy iteration. An extensive account of these and related developments can be found in 
erman (1994). 

iVhen applied to real problems, however, this scheme often runs into difficulties. The 
St notorious, of course, is the curse of dimensionality, caused by the typically very large 
5 of the state space. An equally (if not more) difficult issue stems from the fact that the 
oretical analysis of MDPs presupposes exact knowledge of underlying stochastic dy- 
nics. Translated into real terms, this calls for accurate model selection and identification 
die relevant parameters. Though this can in principle be a separate, off-line statistical 
‘rcise, the computational overheads can be considerable. 

Phis problem has been brought to the fore forcefully by some emerging apphcations 
artificial intelligence, e.g., in game playing machines and robotics (Barto et al 1995; 
erthi & Ravindran 1994). Here complexity of exact modeling and analysis is very high. 
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but, mercifuily, that of simulation is often not so. To press this point further, consider a 
large interconnected system like a communication network. The dynamics evolves as per 
simple local update rules operating at individual constituent units and therefore is quite 
amenable to simulation on a parallel machine. But the overall dynamics can be extremely 
hard to analyse. This has prompted simulation-based algorithms which are akin to the 
traditional algorithms for computing the value function, but with one crucial difference: 
one replaces a computation using an exactly known transition probability function by an 
actual simulated transition as per the random mechanism that determines it. The algorithm 
is expected to ‘see’ the actual transition mechanism through an averaging affect. Mathe¬ 
matically speaking, these involve a marriage between the traditional algorithms for MDPs 
and stochastic approximation, a procedure in statistics with a long and illustrious history 
(see Benveniste et al 1990, for a comprehensive account of the latter). 

These algorithms again fall into two broad classes. The first is the Q-leaming algorithm 
of Watkins (1989) which has been extensively analysed (Watkins & Dayan 1992; Jaakola 
et al 1994; Tsitsiklis 1994). This can be viewed as a stochastic approximation version of 
the value iteration. The other strand consists of the actor-critic algorithm of Barto et al 
(1983) and its variants which may be viewed as stochastic approximation counterpart 
of policy iteration. Though some mathematical analysis is available in this case as well 
(Williams & Baird 1990), the situation is a little harder than Q-leaming because of a 
certain inherent problem in policy iteration. In policy iteration, each update requires the 
computation of ‘cost to go’ function or ‘value’ function for a fixed policy. This in itself 
requires a ‘value iteration’ sandwiched between two policy updates, albeit a linear one 
(because the policy is fixed). One may view this as an algorithm with two loops, the inner 
one performing the linear value iteration for a fixed policy and the outer one updating the 
policy at the end of it. In practice, of course, the update of the outer loop cannot be kept 
waiting forever while the inner loop algorithm converges asymptotically to the desired 
cost to go function. Various ad hoc schemes have been proposed to ensure reasonable 
behaviour (Barto et al 1995). Our aim here is to propose a variant of the actor—critic 
algorithm based on some recent results on stochastic approximation with two time scales 
(Borkar 1996). The idea here is to operate the inner and outer loops with different step- 
size schedules, so that the inner loop moves on a faster effective time scale than the 
outer loop. This ensures that while the inner loop sees the current policy in the outer 
loop as quasi-static, the outer loop sees the value iteration of the inner loop as essentially 
equilibrated. This provides an actor-critic scheme that is asymptotically exact, at least in 
principle. 

Having said all this, we should hasten to add that the simulation-based algorithms are 
not without problems. The first major problem is that many of these simulations call 
for a parallel, distributed implementation of the algorithm. This throws up issues like 
asynchronism and interprocessor communication delays. It is well-known that even simple, 
innocuous iterations that perform ideally in a centralised implementation can go hay¬ 
wire in a parallel, distributed environment (Chazan & Miranker 1969). Fortunately, a 
considerable body of work is now available on conditions under which one may still get 
the desired convergence (Bertsekas & Tsitsiklis 1989; Borkar 1994). We apply these ideas 
to the present algorithm to underscore conditions under which the desired convergence is 
preserved in a parallel distributed set-up. 
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rhe second problem is the familiar ‘curse of dimensionality’ which looms much larger 
simulation-based algorithms. This is because they involve iteration of vectors indexed 
state and action, not state alone, which increases the dimensionality many-fold (not 
mention that the algorithm perforce is restricted to finite state and action space case), 
us usually the algorithm must be accompanied by an approximation scheme to make it 
:table. The traditional approach would be state aggregation whereby one clubs parts of 
te space into single meta-states. An alternative approach gaining currency is to approxi- 
te the value function directly (Schweitzer & Seidman 1985; Tsitsiklis & Van Roy 1996). 
is is appealing in view of the recently established function approximation properties of 
iral networks and good algorithms for neural network training that can be exploited here, 
vertheless, certain counterexamples in literature (Tsitsiklis & Van Roy 1996) suggest 
t the approach is not without its problems. We shall discuss these issues later in this 
Der. 

The paper is organised as follows. The next section briefly reviews the MDP paradigm for 
counted cost problem and describes our variant of actor-critic algorithm associated with 
Section 3 recalls some key results concerning stochastic approximation, from Borkar 
)96) and Borkar (1994) respectively. These are used in § 4 for the convergence analysis of 
; algorithm and its asynchronous version. Section 5 describes an approximation scheme 
5ed on state aggregation. Section 6 presents a simulation example. Section 7 concludes 
th a brief discussion of further research issues. 

This paper derives much from Borkar (1994; 1996) in terms of technique. Because of 
s, we have opted in favour of giving sketches of proofs instead of complete details, 
ving the latter would require reproducing the aforementioned references in totality, a 
nsiderable overhead in terms of length and mathematical abstraction. Needless to say, 

; have pointed out the minor variations as and when needed. Complete mathematical 
tails can be found in Rao (1996). 

MDPs and the actor-critic algorithm 

e begin by recapitulating some well-known facts about MDPs. Puterman (1994) is a 
od general reference for the material. 

Let S ~ {1, 2,. .., A = {<20, < 31 ,..., < 2 r} be prescribed finite sets and p : S x S x 
[0,1] a map satisfying 

p(/, j, a) e [0,1], k,a) = I Vi, j, a. 

k 

1 MDP (equivalently, a controlled Markov Chain) on state space 5, with action space 
and transition probability function p(•,-,•), is an 5-valued random process Xn,n >0, 
tisfying 

P(X„+i = j/Xk, Zk, k <n) = piXn, j, Zn) V« > 0, 

tiere {Zn} is an A-valued ‘control’ process. If {Zn} is of the form Zn = '^{Xn), n > 0, 
r some map u: 5 A, we call {Z„}, or, by abuse of terminology, the map v it- 
If, a stationary policy. More generally, if for each n, Z,i is conditionally independent 
’ Xm, Zm,rn < n, given Zn. we call it a stationary randomised policy and identify 


it with the map (p:S-^ V(A) (V(- • •) = the space of probability vectors on • •’) 
which gives the conditional law of given Xn. For i e S,a e A, let n(i, a) de¬ 
note the < 2 th component of (pii). Under a stationary policy v (resp., a stationary ran¬ 
domised policy (p), {Xn } is a time-homogeneous Markov chain with transition probabilities 
[[p(z, j, v(i))]] (resp., y, (/5(0)]] where ^(i, y, (p{i)) = Eag/i h a)Tc{i, a)). By 
a further abuse of terminology, we identify the stationary randomised policy (p with 
the vector n = {K{i,a)], where the elements are ordered lexicographically. Note also 
that the class of stationary randomised policies contains the class of stationaiy policies, 
since the latter correspond to the case when each (pii) is concentrated at a single point 
in A. 

We shall consider the infinite horizon discounted cost control problem. In this a discount 
factor a e(0,l) and a mnning cost function : 5 x ri —^ are prescribed and the aim is 
to minimize over all admissible {Z„] the quantity 


n=0 

Define the ‘value function’ V : S R by: For i e S, 


V(i) = min E 


OQ 


Y^a"k(X„.Z„)/Xo = i 

n=0 


the minimum being over all admissible {Zn}. It is known that V is the unique solution to 
the dynamic programming equations 


V(0 = min 

a 


kii, a) A-a Y^piiJ,a)Vij) 
j 


i G S. 


Furthermore, any v : S R satisfying: vii) attains the minimum in the rhs of the above, 
yields an optimal stationary policy, optimal for all initial conditions. In fact, {Z«} is optimal 
if and only if with probability one. 


Z„ G argmin(fc(X„,.) + a^/?(X„,y, OFCy)). 

j 

The key to the control problem therefore lies in finding V (•)• Two standard approaches 
for this are value iteration and policy iteration. (A third approach to MDPs reduces them to 
linear programming problems. We do not consider this approach here.) The value iteration 
starts with an initial guess Vq ^rid iterates as per 


Fn+i(/) = min 


kii, a) -h a /?(/, y, a) Vn (y) 


, i &S, 


for n >0. Using Banach contraction mapping theorem, it is easy to show that Vn V at 
an exponential rate. 

The policy iteration scheme, on the other hand, starts with an initial guess v : S A 
for an optimal policy and improves upon it iteratively as follows: At nth iteration 
Step 1: Compute V^ : SR defined by 


Vn(i) = E 


, i E S, 


oo 

x; oc^’kiX^, Z^)/Xo = i 

m=Q 

expectation being under the stationary policy Zm = Vn (Xm), m>0. This is done by 
ving the linear equations 

V^d) = k(i, Vn(i)) + O' j, Vn(i))Vn(j), i € S. 

j 

p 2: Find : S Rhy 


Vn+i(i) e argmin 


k(i, •) + a ^ p(i, j, -WnU) , 
J 


i e S. 


3ne can show that the cost strictly decreases at each iterate as long as 14 is suboptimal, 
:uring convergence of 14(0 to the optimal pair. 

We now derive the appropriate ‘simulation-based’ version of this. There are some key 
ferences between them which need to be underscored. The first, of course, is that we 
ilace each summation involving p(i, a) by a simulated transition as per that prob- 
lity vector. In order for this to work, the algorithm should do some averaging. This 
insured by using an incremental version which makes only a small change in current 
rates at each step, weighted by a stochastic approximation - like decreasing step-size, 
iondly, we operate with stationary randomised policies rather than stationary policies 
that simple update equations for the probability vectors therein can be written. Finally, 
linear system of step 1 is replaced by an iterative scheme for its solution before incor- 
rating it into the simulation based scheme. This scheme is a ‘stationary value iteration’ 
en by 


V"+\i) = k(i, D„(0) + ctJ2p(i’ J- OC(<), i e 5, 

j 

m > 0. This forms the ‘inner loop’ of the algorithm, wherein m is being updated for 
rh fixed n. The equilibrium value Vn(-) to which will converge is then passed 

to the outer loop for updating the policy. The crux of the algorithm we propose is to 
lieve this two-tier structure by using two different time scales. 

The ‘centralized’ version of our variant of the actor-critic algorithm is as follows. Let 
'n)}, {Z>(n)} be decreasing sequences in (0,1) satisfying 

^a(n) = ^Z7(n) = oo < oo, a(n) = o(b{n)). 

n n n n 

L <20 € A and let P denote the projection of an r-vector onto the simplex D = 
:i,..., \ Xi > 0, Vi, ^ 1}. Let 7r(i) denote the r-vector [7r(i, ai), ..., 7r(i, a,-)] 

t ea denote the r-vector whose components are indexed by elements of A \ {<2o}> with its 
fnponent indexed a as 1 and all other components equal to 0. The algorithm starts with 
initial pair Vb(-) € and 7ro(/, < 2 ), i € S, a e A\ {ao}, and iterates according to 


Vn+i(i) = (1 - b{n))Vn(i) + bin)[kii, (Pn{i)) + 


I t; o, auu lui cauu i fc o, a fc /i \ tao), 


^n+l(0 = p I ^nii) + a{n) I (Vn(i) - k(i, a') 

\ \a'eA\{ao} 

-CiVninnii, a')))7Tn{i, a')ea' + 

TTfi+lO', «o) = 1 ~ Y 

a^aQ 

for n > 0, with (p(i) = iZnii, ■) € ViA) and k{i, cp) = 7r(i, a)k(i, a) for cp e 'P(A). 

Furthermore, = [^n(l),. •., ^n{s)], Vn = [[Vn(U «)]] are resp. S^, 5''^^^-valued ran¬ 
dom variables conditionally independent of each other given f,-, ?];, i < n, with the 
corresponding conditional distributions equal to 

5 

n •, (PnU)) and Yl n •’ 

7=1 jeSaeA 

respectively. The reader may verify that this is in confirmation with our verbal description 
earlier. A small modification, however, is warranted. If any 7 r„(z, •) is on a face of the 
simplex V{A), it will remain there thereafter. To avert this, a small diminishing noise 
0'(n), with a Lebesgue continuous law, is added to push it away from the stable manifold 
of the unstable equilibrium points on the face of the simplex. 

The distributed, asynchronous version is more complicated and needs additional notation 
and assumptions. To start with, let /i, I 2 be sets of subsets of S', 5” x A \ {no) resp. that 
together cover resp. S,S x A\ Let {y„}, {Z„} be resp. h h- valued processes 
with the interpretation: Yn is the set of z e. S such that (z) gets updated at time n and Zn 
is the set of (z, a) e S x A \ {no} such that Tinii, a) gets updated at time n.We impose on 
these processes the condition: There exists a deterministic A > 0 such that with probability 
one. 



1 n-i 

- E e S'™! > A, i 6 s, 

m=0 

1 «-l 

■ E “) e Zm] >A,ieS,a€A. 

m=Q 

This ensures that the distributed asynchronous version updates all components comparably 
often in a precise sense. (See Borkar 1994 for a graph theoretic sufficient condition for the 
above to hold.) 

Secondly, we introduce delays t„(z, j), fnii,a, j), f(z, a) taking values in [0, 1, , 

min(n, AT)], N >0, with the assumption: r„(z, z) = 0 Vz, n. The idea is: Each component 
of the iteration is updated by a fixed processor, which receives the updates from other 
processors with random but bounded interprocessor communication delays. Thus the pro¬ 
cessor updating V'n(z) receives at time n, yn-xnii,j)ij) and not Vnij). Similarly, the pro¬ 
cessor updating 7r„(z, a) receives at time n, and not Finally, the 
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:essor updating y„(/) at time n receives ti) and not 7Tn{i, a). Assume that 

^ j), j). %i(i, ^)} are independent of hm m < n for each n. The latter 

defined as before, except that the conditional law of given r]m, m < n gets 
aced by YljeS (pnU)) where 
(pnd) = •), i € 5. 

tiould be remarked that the boundedness condition on delays can be replaced by a mild 
iitional moment bound as in Borkar 1994 at the expense of additional technicalities.) 
inally, introduce for n > 0, 

n 

v\{i,n)= X] /{/ e Ym}, i e S, 

m=0 

n 

V 2 ii, a, n) = G Z,„}, i e S, a e A. 

m=0 

e that the processor updating Vn(i) (resp. itnd, a)) knows v\ (i, n) (resp. V 2 ii, a, n)) at 
; n (that being the number of updates he has performed till then), even when he does 
know n, i.e., the ‘global clock’. 

he distributed, asynchronous version of the algorithm then is: for / € 5, a e A\ {^o}, 
Vn+\(i) = Vn(i) + b(v\(i, n))[k(i, (pn{i)) 

A-aVn-r„m„i^i))iUi)) - Vn(i)]I{i e Y„], 

ir„+i(0 = P I ^n(0 + a{v 2 {i,a,n))((Vn(i) - k(i,a) 

V a'eA\{c!o} 

a yn—x{i,a,r]„ (i,a)) din di d> ^) 


+ (p(n))I[d,a) e ZnjCa' 

^n+\ditlQ) = l ^ ^ TT^+l (l, £i), 
a^ao 

0 and [(p(n)} is a random sequence converging to zero. The role of this sequence is 
e as that of (p(n) in synchronous algorithm. For this algorithm, we shall impose the 
owing additional restrictions on {a{n)], [b{n)Y Let [c{n)] denote {a(n)} or {b(n)}. 
n 

There exists r e (0, 1) such that < co- 

For a: € (0, 1), sup„ c([xn])/c{n) < oo, where [• • •] stands for ‘the integer part of 

For X € (0, 1) and A(n) = Em=o H[yn\)/A{n) 1 uniformly in y e [x, 1]. 

examples of {c(n)} satisfying the above are; 1/n, \/n ln(«), In(rt )/n etc., with modifi- 
on for n =0,1 where needed. 

Ve shall analyse these algorithms in § 4 after a brief review of some relevant topics in 
:hastic aonroximation in the next section. 



This section briefly recalls some recent results in stochastic approximation algorithms 
needed for our work. The stochastic approximation algorithm in its simplest form is the 
fif-dimensional iteration 


X(n+ 1) = X{n) + a{n)(h{X(n)) M(n)), n > 0, 


(1) 


where {a(n)] is as before and {M(n)} is a sequence of integrable random variables 
satisfying 

E[M{n)/X{m), m < n, M(m), in < n]=0, n > 0. (2) 

The convergence of this algorithm to a desired limit is usually established by first estab¬ 
lishing sepai*ately that with probability one, 

sup j X(n) |< oo, ^^a{n)M{n) < oo. (3) 

n 

Given these, one way to analyse its asymptotic behaviour is by showing that it asymptoti¬ 
cally tracks the ordinary differential equation (ODE) given by 

x{t) = h{x{t)). 

Assume h to be Lipschitz with linear growth, ensuring that this ODE has a unique solution 
for any initial condition, defined for all t > O.Suppose this ODE has a globally asymp¬ 
totically stable attractor J. Then by converse Liapunov theorem (see, e.g., Yoshizawa 
1966) there exists a continuously differentiable V :R^ —^ satisfying V(x) oo 

as ||;r|| -» oo and VV'(a:) • h{x) < 0 for x ^ J. Now, given T, 8 > 0, call a bounded 
measurable function y (-): a. (T, 5)-perturbation of this ODE if there exist 

0 = To < Ti < 72 < • • • Tj t 00 , such that Tj+\ > 7}• + T and there exist solutions (•) 
of the ODE on each interval [Tj, Tj] such that 

sup iWit)--y(t)\\ < 8. 

Then one has; 

Lemma 1 (Hirsch 1989). For every e > 0, T > 0, there exists a 5o > 0 such that for any 
8 € (0, (5o), every (T,8)-perturbation of the ODE converges to the e-neighbourhood of J. 


\ 

( 

i 

I 



The idea of the proof is; The Liapunov function V must decrease by a minimum positive 
quantity along every x-i (•) that does not intersect the ^-neighbourhood of J and therefore 
along the corresponding patch of y (•). This can happen for at most finitely many consecutive ; 

J’s, so y (•) must eventually intersect this neighbourhood. Analogous considerations show :> 

that it cannot move away too much once it has done so. (See the appendix of Borkar 1996 |i 

for details.) ! 

The convergence analysis of the stochastic approximation algorithm now hinges on the 
following ‘time-scaling’ argument: Let r(0) = 0, t(n) = and pick m(n) 

according to: m(0) = 0, m(n) = min{f(/:) 1 t(k) > t{m{n)) -f T]. Set 7}- = t(jnU))^ 
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0. Define t > 0, by: y(t(m)) = X{m), m > 0, with linear interpolation 

r(m), t(m + 1)], m > 0. Let be the solution of the ODE on [Tj, 7}h-i] with 
"y) = y{Tj), j > 0. Then 3 ^(-) on [Tj , Tj+\] may be viewed as an Euler approximation 
e ODE with a nonuniform but decreasing (with j) step-size, modulo an error term due 
i^(n)} that also becomes asymptotically negligible thanks to (3). The above lemma 
applies ‘eventually’ (i.e., for sufficiently large y) for each 6 > 0, ensuring Z (; 2 ) -» J 
probability one. 

ow consider a ‘two time-scale’ variant of the basic algorithm; 

X(n -f- 1) = X{n) -t- ain)(F(X(n), Y(n)) + M{n + 1)), 

F (22 -h 1) = Y{n) -f b{n){G{X{n), Y{n)) -b M'{n + 1)), 

re F, G are Lipschitz with linear growth and M(7i), M'{n) are integrable random 
ibles uncorrelated with the past (i.e., satisfying (2)) and a{n) = o{b{n)). Suppose 
probability one, the following hold: 

sup ( X(n) 1< oo, 'Y^a{n)M{n) < oo, 

n 

sup 1 Y(n) 1< oo, \^a(n)M'(n) < oo. 

" n 

Iso suppose that for each x, the ODE, 

y(t) = G(x,y(t)), (4) 

a unique globally asymptotically stable equilibrium point X(x) where A,(-) is Lipschitz 
inuous and the ODE, 

x(t) = F(x{t), X(x(t))), (5) 

a unique globally asymptotically stable attractor J. 

orem 1. With probability one, (X(n), Y (n)) {(x, X(x)) | x E 7}. 

proof can be found in Borkar (1996) for the case when 7 is a singleton and extends 
ly to the more general case. The idea is to mimic the above time scaling argument first 
i b(n) (i.e., t(n) = ^(0), so that the interpolated trajectories track the ODE 

x{t) = 0, y{t) = G(x(t), y{t)), 

then again with a(n) (i.e., t(n) = so that the interpolated {Xin)} tracks 

Lemma 1 is used in each case in the obvious manner. Thus the fast component {^(n)} 

I the slow component {Z(n)} as quasi-static, while the slow component sees the fast 
iponent as ‘essentially equilibrated’. 

he distributed, asynchronous version of the algorithm (1) is as follows: If Z/ (n) is the 
component of the vector Z(n), it is updated as per 

Xj{n -i- 1) = Zi(72) + a{v{i, n)){h(Xi(n — Xn{i, 1)), • • •, Xd{n — t„(/, d))) 
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For this case, the rather Intricate analysis of Borkar (1994) shows that a suitably inter¬ 
polated version of {X (n)}, if bounded, tracks the ODE 

i(0 = \h(x(t)). 
d 

The scalar 1/i/ up front amounts to linear time scaling that does not alter the qualitative 
behaviour. Thus X{n) -> J with probability one as before. (It should be remarked that 
the algorithm considered in Borkar 1994 is slightly more restrictive than (1) or (5), but the 
same arguments go through nevertheless.) 

4. Convergence analysis 

In this section we shall adapt the ideas of the preceding section for the convergence analysis 
of the actor-critic algorithm proposed in § 2. As already mentioned earlier, only a sketch 
of the proofs will be given. 

Define the map F;P(A)^ Xby F(-, •) = [Fi (•,•),•••, ^s(*, where 
Fiiz, x) = Y.a n) + a Y.j Ha a)p{i, a)xj for x = [xi,..., x^]^ and 

z = [[z{i, <3)]], i e S, a e A with z(i, •) e V(A) Vi. Also define the following norms on 
R^ 

lkllp= I ^ , p€(l,0O), 

llxiloo = inax 1 Xi 1 ■ 
i 

Then it is easily verified that for each fixed z, the following contraction condition holds: 

11 F(z, x) - F(z,y) Iloo < allx-ylloo , x,yeR^. 

The first iteration of the centralised algorithm can now be written as 

V „+1 = + b{n){F{iZn, Vn) - Vn) + b{n)M{n), n > 0 (6) 

for suitably defined M{n) which will satisfy (2) (with X{m) = V^). 

Lemma 2. With probability one, the iterates of the centralized or asynchronous algorithm 
remain bounded. 

Proof. We consider the asynchronous case, as the centralized case is a special case thereof. 
The iterates of {7r„} are bounded anyway because of the projection P onto a bounded set. 
That Vn remain bounded with probability one follows exactly as in theorem 1, pp. 190—191, 
of Tsitsiklis (1994). (In Tsitsildis 1994, the algorithm slightly differs from (6) insofar as 
P(^n, Vn) would be replaced by F(Vn) for some F satisfying (say) l|F(x) — F(y)lloo 
~ y Iloo- Nevertheless, exactly the same proof applies.) 

In turn, this implies that {M(n)}, which is defined in terms of Vn, remains bounded with 
probability one. A direct calculation shows that in addition 

f(n) = E[M{nf /Vm,7Tm, Tm(i, j), a, j), ^m(i, a), Ym, Zm, 
fn <n,i,j e S, a € A] 


VI □ 
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lain bounded with probability one. Since Yin b(n)^ < oo, we then have Yn b(n)^\lr(n)^ 
DO with probability one. The property (2) renders the process 

n 

^^b{i)M(i), n >0, 
i=0 

lartingale and the foregoing then ensures that its quadratic variation process (see Neveu 
75, for a definition) is convergent with probability one. Proposition VII-3-(c), pp. 149- 
3 of Neveu (1975) then ensures that 

oo 

^b(i)M(i) 

convergent, and hence finite with probability one. This, in conjunction with lemma 2 
Dve, completes our verification of (3) for iteration (6). 

These considerations will enable us to use the results of the preceding section. Prior 
doing so, we look at the expected ODE limits. The first one to be considered is the 
iinterpart of (4) in our set-up, which is 

i(0 = F(7r, x(t)) — x(t) (7) 

■ F as above and a fixed ir e V{Ay . 

mma 3. Equation (7) has a unique globally asymptotically stable equilibrium point Vyt 
>en by 

oo 

%(i) = E 5^a'"fc(X„,.Z„)/Xo = i , 

m=0 

? expectation being with respect to the stationary randomised policy n. 

oof. Standard contraction mapping arguments show that ■) has a unique fixed point 
d usual dynamic programming type arguments show that defined above must be it. 
)w for p € (1, oo), explicit differentiation leads to 

- v„\\p = \\x(^t) - VA'p'^’i-W) - VA\p 

1 -S _ 

+ - y\(xi(t) - VAi)Y-'(Fi{7t,x(t)) - Fi(n, V,r)) 

^ 1 = 1 

< -\\x{t) - VjtWp + \\Fin,x{t)) - Fin, Vyr)\\p 

lere the inequality follows from an application of Holder’s inequality. Let p oo. 
dng continuity of p -> ||xllp on [1, oo] and the contraction property of Fin, •) under 
c II • II CO-norm, we have 

4:\\xit) - v^lloo < -d -a)||x(0 - l4!loo , 


i-NULc LiiciL Vjj; li, LiiiuiiuLCiiz,cu uj ulc iiiical bysLCUi ui cquuuuiijs r , Vjt) — vjir, 

it depends smoothly on the coefficients thereof, hence on ti. In particular, the map is 
Lipschitz on V{AY. ^ L, 

Define a vector field G(-) on V{AY by;G(-) = [[G/^i(-)]] for / e 5, a e A, where ' p 

Giain) = 7T{i,a) |^Vjr(z) -k{i,a) - (xY^ p{ij, a)Vjt(j) 

The second ODE (the counterpart of (5)) that we consider is 

TCit) = G{7t{t)). ( 8 ) 

It is easy to verify that this ODE will always remain in V{AY if 7 r( 0 ) is. Furthermore, 
it will converge to equilibrium points, the latter being the set of those n e ViAY' that 
satisfy; for each (i, a), either jT{i, a) = 0 or k{i, a) A- a p{i, j, a)Vyt{j) = Vjr (0- ^ 
little thought shows that the stable equilibrium points are those for which, for each {i, a), 
either 7 r(j, cz) = 0 or 

Vjr{i) = mm ^{i, + a ^ p(z, 7 , a)Vjt{j) 

Since not all 7r(z, a) can be zero for a given /, the above equality holds for all z.Hence tlie 
only stable equilibrium points are the tt for which 14 equals V. These are precisely the 
optimal stationary randomised policies. 

Consider the centralized algorithm. 

Theorem 2. With probability one, the centralized algorithm converges to {{V,n) \ n is 
an optimal stationary randomised policy}. 

Proof. We can adopt the ‘two-time’ scale results of the preceding section with one differ¬ 
ence. The difference here is the presence of the projection operator P in the algorithm. 

This, however, is a standard feature of many stochastic approximation algorithms and the 
theory thereof is well-understood; see, e.g., Kushner & Clark (1978). As in the theorem 
5.3.1, p. 191, of Kushner & Clark, we can conclude that the limiting ODE tracked by a 
suitably interpolated version of (and hence by {jTn] itself) is given by the 
dimensional ODE 

jT(t) = P{G{7r{t))), 




where GjvAtt) = G/o([7r[l, ffil b): ■ ■ ■ :n{s, a): 
1 7 € S, and P{h{-)), for a vector field /z(-) on V{AY is defined by 


P{h{y)) = 


lim 

0 < 3->0 


P{y-A h{y)8) - P{y) 
8 


If the limit is non-unique, one considers the set of all limit points and treats the above as 
a differential inclusion rather than an ODE. Fortunately for us, P{G{tc)) is well-defined 
and in fact equals G(7r), as can be easily verified. Thus the limiting ODE is 
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ft — G(jT{t)). 

now augment TT(7) by adjoining s additional corriponentsirl/, flo) = 1—^ ^ 

Denote the enlarged vector by 7r(t) again by abuse of notation. It is easy to verify that 
s redefined, jr(-) satisfies (8). The convergence for the two time scale stochastic algo- 
ims can now be invoked to claim that with probability one, converges to the set of 
lilibrium points of (8). The final step is to show that it will in fact converge to the set of 
ale equilibrium points of (8) with probability one (Pemantle (1990), see also Brandiere 
Duflo (1996)). To show this, all we need is that the set of points that get attracted to an 
:table equilibrium point have zero probability. It is easy to see that Lebesgue measure of 
stable manifold of an unstable equilibrium point is zero. Thus it is enough if the noise 
t) has positive density with respect to Lebesgue measure in a shrinking neighbourhood 
the origin. The claim follows. □ 

eorem 3. The claims of theorem 2 also hold for the decentralized algorithm under 
litional hypotheses stipulated in § 3. 

Fhis follows simply by combining the foregoing with the scheme of Borkar (1994). This 
npletes our convergence analysis of the algorithm. 

Approximation issues 

mentioned in the introduction, in many applications the ‘curse of dimensionality’ forces 
; to interface the above and other simulation-based algorithms with an approximation 
leme. Here we sketch a scheme based on state aggregation, adapted from Tsitsiklis 
Van Roy (1994) where it is proposed as a ‘look-up table scheme’ in the context of 
learning. 

Fhe idea is to partition the state space S into disjoint nonempty subsets , 52 , , S/n , 
:h identified as an aggregated state. For each j, let fj{-) be a probability vector on Sj 
;h fj{i) > 0, V/ e Sj. The centralized version of the algorithm nins as follows: At 
:h 77, generate for each j, 1 < j <m, random variables (X'l, , 7?“^^ a e A) with law 

P(X’J = ij, = kj, = Xaj, aeAJ = h2,..., m) 

m 

= n ^nU)) n 

jz=] a£A 

lependently of the past, where cpn {j) = rcn (y, •)• Define rjj^ e (1,2,..., m} by fija = k 
fj^ e Sic. The iteration at time n is: for each j, 1 < j <m, 

V „+1 U) = (1 - b(n))V„U) + brnhiX], <PnU)) + 

^Ii+i(j) = f’(^n(j) + a(n)f VnU) - k{Xj, a) 

\ \aeA\{flo) 


^n+lU.ao) = \ - ^ 7r„+i(;,a), 
a^a[) 

where P is the projection onto the appropriate r-dimensional simplex. 

An analysis similar to the foregoing shows that with probability one, these iterates 
converge to (!/*(•), 7r*(-, •)), where for j 6(1,2,..., m}, 

m 

V*(j) = mm a) + a'Y2V(l) ^ p(i, I, a)] 

i /=1 xgSi 

and support .)) c argmin of the rhs above. For error estimates for such schemes, 

see Tsitsiklis & Van Roy (1994) 

A distributed version of this approximate scheme along the lines of the foregoing can 
also be analysed accordingly. 

6. Simulation experiments 

To demonstrate the convergence of the proposed algorithm we took two examples. The 
MDPs of tliese two examples are similar to those used to model admission control into 
an M/M/1 queue and routing to parallel M/M/1 queues respectively (Walrand 1988). In 
both these examples, in any iteration, only one component of V and one component of 
lu are updated. The probability of update of a component in an iteration is equal for all 
components and is independent of everything else. Since 7T,i(i, a) cannot change once it 
becomes zero, a small noise is added whenever it is sufficiently close to zero. The determin¬ 
istic value iteration was used to find out optimal value function and policy. The algorithm 
was started with value function being identical to zero, and with all the randomised policy 
probabilities equal. The examples are described below.The notation used here is the one 
introduced in § 2. Interprocessor communication is assumed to be instantaneous (i.e., no 
delays). 

6.1 Example 1 


5 = {0,1,...,A}, 

A = [0,1}, 

0, if j < max (7 — 1, 0) or y > / 4- 1, 

X, if j = min(f + \ , N) and a = 0, 

or j = i and a = 1, 

1 — A., if y = max(i — 1, 0), 



k(i, a) = 


U, ifa = 0, 

[/ -I- c, if a = 1. 


For the purpose of simulation we took: 


A = 10, a(n) = ({[jLj-hl}//i)3 

A = 0.65, 

c=10, b{n) = {l-^\ + l}/n, 
a = 0.99. 
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The optimal value function and policy were: 

1/=^ [669.0261 679.4227 694.0252 711.0231 730.3492 751.9383 
775.7268 801.6532 820.9869 831.7235 836.4438], 

7r(-,0) = [l 1 0 0 0 0 0 0 1 1 1], 

jtO, 1) = 1 — Tc(i, 0) i e. S. 

After 2 X 10^’ iterations the result of our algorithm was 

y = [663.2563 673.3967 687.6990 708.0571 725.9087 748.0945 
775.7268 796.7382 820.2483 830.5291 835.0053], 
:r(-,0) = [l 0.9986 0 0 0 0 0 0 0.9871 1 0.9955], 

7r(/, 1) = ] — 7r(/, 0)/ G S. 

6.2 Example 2 

S = {(iJ). Q<i <N, 0<j <N}, 

A = {0,1}, 

X, ifk = min(/ + 1, A^) and j — 1 

and a — 0 

= i and I — min(y + 1, A/) 
and fl = 1, 

/X], ifk = max(i — 1,0) and / = j, 

1 — X — fj,{, ifk = i and I = max (7 - 1,0), 

0, otherwise, 

j),a) = cii + cij. 

For the purpose of simulation we took: 

A=10, ^(/2) = ({LTgoJ + l)/'^)^ 

1=0.65, A(n) = 0.1((LigoJ + ll/'^) 
ci= 10, C2 = 15, 
q!=0.99, 1 = 0.5, 

/xi = 0.3, /X 2 = 0.2. 

The optimal value function and policy were: 

> [3.3602 3.4448 3.5978 3.8137 4.0899 

3.4281 3.5179 3.6710 3.8863 4.1618 

, 3 3.5277 3.6183 3.7696 3.9833 4.2578 

LLV 0,7)11 - 10 X 3 g3gQ ^ jiji 3.8643 4.0761 4.3496 

- 3.7172 3.7922'3.9370 4.1474 4.4201 

3.7590 3.8328 3.9766 4.1864 4.4588 


4.2539 

4.3018 

4.3875 

4.4937 

4.6036 

4.7032 
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iterations the result of our alg 

□rithm was 







'3.3715 

3.4610 

3.6047 

3.8221 

4.1100 

4.2684 



3.4399 

3.5317 

3.6813 

3.9045 

4.1760 

4.3166 

.. . .. 1 . 


3.5316 

3.6327 

3.7757 

3.9971 

4.2682 

4.3978 

Llv(z,y)Jl = lU''x 

3.6365 

3.7207 

3.8772 

4.0869 

4.3490 

4.4987 



3.7219 

3.8000 

3.9480 

4.1499 

4.4253 

4.6193 



3.7650 

3.8431 

3.9849 

4.1919 

4.4675 

4.7109 


r 

l.OOOO 

0.9861 

l.OOOO 

1.0000 

0.9900 

o.oooo" 


0.0000 

1.0000 

0.9900 

1.0000 

0.9844 

0.0000 


0.0000 

0.9997 

0.9910 

0.9977 

0.9994 

0.0000 

[[^((/, ;),0)j] = 

0.0000 

1.0000 

1.0000 

1.0000 

0.9977 

0.0000 


0.9975 

0.9998 

0.9884 

0.9999 

0.9999 

0.0000 



1.0000 

1.0000 

1.0000 

0.9998 

1.0000 

0.0000 


Note that the convergence is slow, the price we pay for not using prior information about 
the ti'ansition mechanism as well as for asynchronism. 


7. Further directions 

We conclude by listing some promising research directions. 


Average cost control: In many problems such as control of communication networks 
where quasi-equilibrium behaviour is desired, long-run average cost is preferred to dis¬ 
counted cost. It would be useful to extend the algorithm to the average cost setup, possibly 
using the techniques developed in Abounadi et al (1996a,b) for Q-learning. 


Approximation based on compact representations: In addition to the approximation 
scheme proposed in § 5 above, Tsitsiklis & Van Roy (1996) also consider another scheme 
based on compact representations in the context of Q-leaming. The idea is to directly ap¬ 
proximate y (•) by a function belonging to a prescribed parameterized family and update 
the parameter in question rather than updating V (•) directly. In the actor-critic scheme, 
however, there is an additional iteration for 7r«(-, •)- One can conceivably write a function 
approximation scheme for 7r(-, ■) using a parameterized family (such as a neural network 
- see, e.g., Santharam & Sastry 1997) and update the probabilities recursively. These 
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edhack implementations: The algorithm we consider above is ‘off-line’, i.e., is based 
a simulation run rather than an actual system being controlled in real time. One can 
avert it into an on-line (or feedback) adaptive control algorithm for a controlled Markov 
ain Xtu n > 0, by setting Yn — V/? and letting •) be the actual randomised 

atrol law being implemented. A natural question then is whether the scheme is asymptot- 
lly optimal. (For the appropriate concept of ‘asymptotically optimal’ in the discounted 
mework, see Schal 1987). Recall that the convergence of the algorithm to desired limits 
luires that all state-action pairs be tried sufficiently often. For states, this may happen 
tomatically if suitable irreducibility conditions are met, even in the feedback case. But 
;he lin (•, •) converge rapidly (to the desired limit or otherwise) all the state-action pairs 
ly not get updated frequently enough. A simple way out of this conflict is to modify the 
;dback law to a convex combination of ^„ (•) and the uniform distribution on A so as to 
sure a minimum probability e > 0 of each a ^ A being picked. For 6 > 0 sufficiently 
lall, the scheme will be nearly optimal within a prescribed tolerance. However, too small 
e may slow down convergence. Thus there is a trade-off involved. A potentially promis- 
\ scheme is to start with a large 6 e (0, 1/rl (to ensure all state-action pairs being tried 
quently) and then reduce it ‘slowly’ enough to ensure optimality. (Recall the simulated 
nealing algorithm for global optimization.) It is, however, a nontrivial task to capture 
5 optimal rate of decrease of e in a precise manner. These issues need further study. One 
ould also add that presence of interprocessor communication delays causes nontrivial 
mplications in the feedback case. 


te of convergence: We have not provided any theoretical analysis of convergence 
e. Since a stochastic approximation algorithm eventually tracks the associated ODE 
a precise sense outlined in § 3, the convergence of its interpolated version to a given 
ighbourhood of the asymptotically stable limit of the ODE (assuming one exists) will 
)sely mimic that of the ODE itself. The rate of the latter could be gauged from the 
ipunov function approach. One must, however, invert the time-scaling n t(n) to get 
; convergence behaviour of the original algorithm. 

Even this may be worthless if ‘eventually’ is in too distant a future. There are other 
oblems too; The ODE captures the averaging effect of the algorithm akin to the law of 
ge numbers. But there can be fluctuations around the average behaviour of the ‘central 
lit theorem’ variety. For a two time scale algorithm, the time scales should be sepa- 
;ed enough so that the slow one does not get swamped by the fluctuations of the fast 
e. 

A related issue is the generally high variance of the stochastic approximation algorithms. 
1 additional averaging can reduce this, see, e.g., Polyak (1990). This and other issues 
rtaining to improving the performance of the algorithm need cai'eful study. To cut a long 
)ry short, the newly opened field of simulation-based algorithms for control offers many 
allenges both in theory and practice. 


le work of the first author was supported by a Homi Bhabha Fellowship. 
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An optima! ftiel-mjection policy for performance 
enhancement In internal combustion engines 
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Abstract. A fuel-injection internal combustion engine system is considered, 
wherein supply of fuel to the engine is controlled with the twin purposes of 
maximizing power output and minimizing fuel wastage. The system is modelled 
as a controlled Mai'kov chain and a feed-back optimal control policy is obtained 
for the long-run average reward optimality criterion using Markov decision 
theory. 

Keywords. Dynamic programming/optimal control; infinite state Markov 
chain; long run average reward; optimal policy; fuel injection control in IC 
engines. 


1. Introduction 

Internal combustion (IC) engines are widely used in a variety of applications and almost 
most importantly in automobiles. A large number of such machines use the two-sb*oke 
petrol engines. One of the gravest problems with two-stroke petrol engines is the unreliable 
combustion, which leads to cycles (a cycle comprises one revolution of the crank and is 
equivalent to two strokes of the piston) that do not produce any power and also release 
unignited fuel which goes to the exhaust and gets wasted. Twenty-five percent of the 
cycles being ‘powerless’ is quite common even in the latest engines. Until recently, fuel 
was supplied to the system using a fuel-air mixer (technically known as carburettor) almost 
^ in all cases. This carburettor being in line with the air-flow of the system would supply the 
fuel-air mixture continuously to the combustion chamber system. However, in recent times, 
there is a shift to ‘direct petrol injection’ systems, where air supply is done independently 
and the fuel is directly injected into the system, once every cycle. This has rendered the 
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system amenable to fuel injection control, since one could decide whether the fuel was to 
be injected or not. We refer the reader to Moskwa & Hedrics (1992) and Jones et al (1995), 
and references therein, for the latest trends in this direction. 

In the following, we propose a stochastic model of the system, and also propose strate¬ 
gies for control. The key idea is to predict on-line, the occurrences of ‘powerless’ cycles 
stochastically, and to cut fuel at those random instants, thus minimizing fuel wastage 
without compromising on the power output of the engine. 

2. The problem 

We consider an engine with fuel injection operating in a steady state. The two binary 
variables that are of interest to us every fth cycle are jc/ and y;, which ai'e defined as 
follows. Xi — 0, if fuel is not injected, and Xi = I, if fuel is injected. Similarly, yi = 0, if 
combustion does not take place, and yj = 1, if combustion does take place. We note that 
if we define Zi = x,- — y,-, then Zi is also a binaiy variable such that zt = 0, if x, = y; = 1 
or Xi = yi = 0, and Zi = 1, if x/ = 1 and y/ = 0. It is evident that the possibility 
X,' = 0 and y, = 1 is impossible, since there can be no combustion without any fuel being 
injected. Clearly, if zi = 0, tlien there is no wastage of fuel, and if Zi = 1, then fuel in that 
cycle is wasted. Hence, Yli Zi is a ‘measure’ of the total fuel wastage. Also note that J2i yi 
is a ‘measure’ of the total power output. If we further denote by non-negative constants 
B and A, the cost of fuel per cycle and the profit or gain from power output per cycle 
respectively, then the problem reduces to the combined task of maximizing Yli 
minimizing Yli Bzi. Accordingly, we choose a cumulative reward function C, defined by 
C — A yi — B Y^i Zi. Here, we also note that x/ and y/ being random variables, we 
cannot directly maximize C, though we could maximize the expectation of C. Noting that 
yi are random variables, we also make the following assumptions. 

Assumptions A. (1) {y,} is such that whenever y,- = 0, (2) and (3) below (of (A)) hold 
from i onward and this does not depend on the value of i . 

(2) We denote by pi, p 2 ,... the probabilities Prob{y;_t-i = 1 jy; = 0, x^+i = 1} = pi, 
and for A: > 1, 

Prob{yj+fc = liy,-+fc_i = 1,..., yz+j = 1, y; = 0, Xi+k = 1} = Pk, 

where the vertical bar denotes conditional probability. 

(3) 1 > Pi > p 2 > P3 > • • • etc. 


We now explain the physical significance of A(l)-A(3). 

A(l) implies that whenever there is a ‘powerless’ cycle, the whole system begins afresh 
and what happened before such a cycle does not affect what happens afterwards. This is 
quite reasonable, since otherwise one will have to assume either an infinite memory system 
or an arbitrarily assigned finite memory system. 

A(2) means that combustion history probabilistically determines its future. One should 
note here that this structure must be corroborated by ‘experimental observation’. 

A(3) means that as the sequence of fuel utilization (fuel-non-wastage) becomes longer, 
the chances of encountering fuel wastage increase. This is possibly the outcome of the 
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pessimism that the longer things go ‘right’, the chances that they go ‘wrong’ increase! The 
restriction p\ < 1 is mild and merely allows for consecutive ‘powerless’ cycles. However 
p\ can be as close to 1 as we want. 

In what follows, we shall formulate this problem as a controlled Markov chain and 
apply Markov decision theory to obtain the optimal closed loop feedback control 
policy. 

We shall now describe the manner in which this controlled Markov chain is constructed, 
in lieu of (A). We use {0, 1} as the control set with xi as the control at the zth step. Hence, 
at any stage we pick xi = 1 as the control if we decide to send fuel and choose xi = 0 
otherwise. 

Let {Xn} represent the controlled Markov chain on the state space of nonnegative integers 
with {0, 1} above as the control set. Let Xq = j represent the initial state, for some 
integer j. Then for any integer i > LX/ = j + i if y\ = yi = • • • = yi = 
else if yi^ =: 0 for some integer k, X^ = 0 as well. Hence for i < n, if X„ = /, then 
Xn-i = 0. Therefore any state i in the state space of the Markov chain corresponds to the 
zth consecutive 1 in the y„ sequence. The moment y„ = 0 for some n, the Markov chain 
jumps back to state 0. Let p(i, j, a) represent the transition probability of moving from 
state i to state j when action or control /z € {0, 1} is chosen. The transition probabilities 
then are: 


p(i,i i-1,1) = Pi+l, p(i,0, 1) = 1 - Pi+i, Vz > 0, 

and p{i, 0,0) = 1, Vz > 0. Here pi are as in (A). 

Let R(i, a) represent the one-step reward random variables when the Markov chain is in 
state z and action a is chosen. In lieu of the cumulative reward function C defined earlier, 
we define R(i, a) as follows: 

R(i,l)=A w.p. pi+i, 

= -B w.p. 1-P/+1, 

Vi > 0 and ^(z, 0) = 0 w.p. 1, Vz > 0. Here and everywhere else w.p. stands for ‘with 
probability’. Now let R{i, a) be the one-step expected reward in state z when action a is 
chosen. Then, 

R(i, 1) = A.pi+i - B(1 - pi+\) = {A + B).pi+x - B, Vi > 0, 


and R{i, 0) = 0, Vz > 0. 

A policy is defined as a rule for selecting actions and can in general be randomized. Let 
for any policy n and initial state z. 


07r (0 


lim inf 
«->oo 


Eyr[E]=oR(Xj,aj)/Xo = i 


n + 1 


( 1 ) 


Here Eyt represents the expected value under policy tt, 07r(O represents the average ex¬ 
pected return per unit time when policy tt is employed and initial state is i. We shall say 
that a policy tt* is average reward optimal if (fjt* (0 = max^r Our objective shall 

be to find tt* from the class of all policies. 



Let Vq! represent the value function for the a;-discounted reward criterion defined as: 

r 00 1 


Va{i) = max Ej^ 

Tt 


^c<"R(X„,o„)/Xo = i 


( 2 ) 


L «=o J 

where the discount factor a. € (0,1). The dynamic programming equation for the a- 
discounted reward tells us (Ross 1983, p. 31) 


Vo:(i) = max |R(i, a) + a ^ p{i, j, a)VaU) 

Since we have only two controls 0 and 1, the above equation in our case becomes 

Va{i) = max{i?(L 1) + 1) + «(! -- Pi+i)Va(0), aVa(O)}, (3) 

for i > 0. In what follows, we shall first show that Va{i) is nonincreasing in i. 


Lemma 1. Under (A), R(i, 1) is nonincreasing in i. 

Proof. R(i 4-1,1) = (A + B).pi+2 - 5 < (A + B).pi+i - B = R(i, 1), the inequality 
following because of A(3). This is true for every i . The claim follows. 

PROPOSITION 1 

Under (A), VaCO w nonincreasing in i. 

Proof Consider the value iteration form of (3) (see for instance, chapter II.3 of Ross 1983 
for value iteration). We have 

V^(i) = max{/?(!, \) + (i + 1) 

+ «(! - K+i)Vr‘(0). «V"-'(0)), (4) 

Vi > 0, n > 1. We shall first show by induction that V” (i) is nonincreasing in i for every 
n, and then we shall show that V^{i) 14 (z) as zi -> co, thereby implying the claim. 

Let V'cJ(z) = R{i, 1). Then, by the previous lemma, V^(i + 1) < V^(i). Now assume 
that (j) decreases in j. So, as in (4), we have for z + 1, 

y^(z + 1) = max{R(z + 1,1) + api+ 2 Va~\i + 2) 

+ a(l-Pi+2)Vr'(0), O!V„"-‘{0)}. (5) 

We need to consider only the first terms in the arguments on the RHS of (4) and (5), since 
the second terms are identical. We have, 

R(i, 1) + api+i V"-' (,• + 1) + a(l - p,+i) (0) 

= R(i, 1) + api+i(V;-'(i + 1) - V"-‘(0)) +<*^"-'(0). 

Now since (j) is nonincreasing in J, we have 

+ 2) < vr'(i + 1) < ■ • • < V;?->(0), 


( 6 ) 
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so, V^~ ^ -f 1) — K” \0) > ^ (i + 2) — 1^" ^ (0). Now, by lemma 1 above, we have 

theLHSof(6) > «(i + I, I)+apj+ 2 (Vi'"‘(i+2) - V"”'(0))+ aV^-'(0) or, > 
+1), Vf > 0. Hence, V”(0 is nonincreasing in i,Wn by induction. Since R(i, 0) = 0, 
Vi, it can be easily seen by (2) and lemma 1 that 1 VQ;(i)l < \Ri0, l)l/(l — a). Now as 
in Ross (1983, p. 36) Va(i) = lim„ V^(i), uniformly in i. Hence Va{i) is nonincreasing 
in i. □ 

Let T = min{n > 0|X„ = 0} represent the first time the Markov chain {X^} hits state 
0 starting from any state. Note that 7 is a random time. Let fa be the of-discount optimal 
policy. Further, let / be the policy that chooses action 1 for all n . Now clearly, Ef^ [T/Xq = 
1] < Ef[T/X q = i]. Further, from (A) it is clear that EflT/Xo = i] < Ef[T/X q = 0], 
Vi. Now under / with Aq = 0, we have: 7 = 1 w.p. 1 — pi, and for y > 1,7 = y w.p. 
piP2...P)-i(l - Pj)- Now by A(3), 

oo 

EflT/Xd = 0] = ^ i.px ■ ■ ■ Pi-iO- - Pi) ■■ 

i = l 
00 

i — l 
oo 

1 


where, in the first equation above, for i = 1, the summand is to be interpreted as 

(1 - Pi)- 

Hence ^/^[7/Xo = i] < oo, for all i and a. So, by theorem V.2.4 of Ross (1983), 
— Vo!(0) is uniformly bounded in i and a, as a result of which by theorem V.2.2 of 
Ross (1983), there exists a bounded function h(i), and a constant g, such that 

g + h(i) = max{R(i, 1) + p{i, i + 1, l)h{i + 1) 

+ p(i,0, l)/z(0), /i(0)}, (7) 

where g = limo;->i(l — a)Va(0). Now, as a result of (7) and theorem V.2.1 of Ross 
(1983), there exists a stationary policy Jt* such that (pn*{l) = maxj,: <^7r(0 = ^ 0, 

and is the one which for each i, prescribes an action that maximizes the right side 
of (7). Again by theorem V.2.2 of Ross (1983), 3{Q!n} of discount factors such that 
an € (0,1), Vn and Q!„ t 1 as n 00 and ha„{i) Hi) which is a bounded 
function, and where ha„(i) = V'o!„(/) — Va„(0). By proposition 1, /iq;„( 0 :< 0, Vn, L 
Hence, h(i) < 0 Vi. Also, ha„(0) = 0, Vn, and hence h(0) = 0. As a result, (7) 
becomes 


g + h(i) = max{R(i, 1 ) 4 - pi-i.i.h(i + 1 ), 0 }. 


( 8 ) 
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or 

(A + B)pi^\ — 5 + pi^ih(i + 1) < 0, 


or 

B 

(9) 


n- , j < -^ 

^ ^ - A + 5 + /z(i + l) 

Let 

i — min{i|(9) holds}. 

(10) 


If the set in (10) is empty, we take i = oo. 

The above is summarized in the following theorem, which is the main result of this 
section. 


Theorem 1. Under (A), there exists a state i < oo given by (10), such that the long 

run average reward optimal policy selects control 0 when the Markov chain is in i. 

4. Discussion of the results 

(1) The function /z(0 is monotonically nonincreasing in i. This is because, by proposition 

K!„(j + 1) :< ^a„(0^ Vi, and where {a;„} is the subsequence of discount factors 
0 ! € (0,1) defined earlier such that ha„(i) -> h(i) Vi. Then, (0) - Va„ 4- 1) > 
Va„(0) — Va„(i). Taking limits on both sides as n oo, we obtain /z(i + 1) < h(i) 
Vi. 

(2) As a result of above, note that the RHS of (9) increases with i, whereas the LHS of (9) 
decreases with i and so, we would only be interested in the first i at which the RHS 
overshoots the LHS. 

(3) R(0, 1) > 0 corresponds to p\ > B/(A 4- B). For this case, consider a policy / 
(say) which when starting in state i selects action 1 if R{i, 1) > 0, till the first k such 
that R(i + k, 1) < 0 (which one expects by lemma 1) and selects action 0 after that, 
with the same thing repeated from state 0 subsequently. If vj (i) is the cumulative 
of-discounted reward function corresponding to this policy when starting in state /, 
then clearly v/(i) > 0 and so Va{i) > y/(z) > 0. Further if R(0, 1) < 0, then 
^(* 5 1) < 0, Vi, and hence it would be optimal to have a pohcy which selects action 0 
for ever, starting in any state. It can be formally argued in this case that Va (0 = 0, Vi. 
Hence, we conclude here that irrespective of J?(0,1) being positive or negative, Va (0 
is always nonnegative for every i. 

(4) From (9), define {p„„(i)} as: 

PaAi) = B/(A + 5 + Va„{i) - y„„(0)) 

where {«„} is defined as before. To check that it is a probability, we need to check that 
0 < PaAO < l,Vi.Nowforpa„(i) < l,weneedtocheckif A + 14„(i)-l/a^(0) > 0. 
From (3), note that Va„ii) > Q!n.VQ,^(0). Further as in point (3) above, Va^ (0) > 0, 
Vot^. Hence, 


r uei-injcL LLun puLiLy jur junnunct^ t:nntuiL t:rnfC!u in 


i 


A + Va„{i) - Ka.(O) > A - il~an)Va„(0). (11) 

Now, as in §3, Va„(0) < \R{0, 1)1/(1 -Q;„).If R(0, 1) < 0, then Va„(0) = 0 and we 
are done. So, let R(0, 1) > 0. Then, since -14,, (0) > -R(0, 1)/(1 - a„), we have 
from (11), 


A + V«„(0-V«„(0)>A-i?(0, 1) 

= A — (A + B)p\ + B 
= A(1 - pi) + 5(1 - pi) > 0. 

Further, for pa„ (0 > 0, we need denominator to be positive, and this follows imme¬ 
diately from above. Now, pa,, (/) p(i), Vt, as o(n t where p{i) is the RHS of (9). 

Hence, 0 < p(i) < 1, VT It is hence, a probability. 

(5) Let us consider the cases 5 = 0 and A = 0 separately. 5 = 0 corresponds to ignoring 

fuel loss which is reflected from our optimal policy, since then, p(z) = 0 and so, 
it is optimal to keep sending fuel for ever. This is what one expects from intuition, 
for maximizing power. The case A = 0 corresponds to ignoring power output of 
the engine. Note that Pa„(0 = 5/(5 + (0 — 14„(0))> iti this case. However, 

5(0, 1) = 5pi - 5 = 5(pi - 1) < 0. Hence, 14„(0 = 14„(0) = 0, as mentioned 
in point (3) above. So, Pa„ii) = L Vi, n, and hence p(i) = 1, Vi, again as expected, 
since it tells us that ‘never to send fuel’ policy is optimal. 

(6) Finally, we look at the problem of explicitly computing g and h(i). By assumption 

A(3), we have 0< 1-pi < 1-P2< _Consider a new process with identical 

state and action spaces and identical rewards, but with transition probabilities given 

by. 


p(i, i 4-1, 1) = 


Pi+\ 

P\ 


p{i, 0, 1) = 1 - 


Pi+l 

P\ 


Pi - Pi+l 
PI 


and p(i, 0, 0) = 1, Vi > 0. Let Vii) be the pi-discount optimal value function for 
this new process, where p\ is treated as a discount factor. The analysis of (Ross 1983), 
pp. 98-99, carries through now, and we obtain 


g = (1 - pi)y(O), h(i) = V{i) - V{0). 


The long-run average return optimal policy is one which selects action 0, the first time 
i such that Pj +1 < 5/(A + 5 + (VO' + 1) - V(0))). This completes our discussion 
of the results. 


5. Concluding remarks 

(1) We considered the problem of controlling the supply of fuel in a fuel injection internal 
combustion engine system, with the twin purposes of maximizing power output and 
minimizmg fuel wastage under very general assumptions. 
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(2) We formulated the problem as an infinite state controlled Markov chain on the state 
space of nonnegative integers using binary control variables, and applied dynamic 
programming techniques to obtain the long-run average reward optimal policy. The 
optimal policy turned out to be a threshold policy which cuts supply of fuel at a certain 
state i mentioned in theorem 1. As a result if i < oo, the overall Markov chain under 
this policy becomes finite state with state space {0, 1,2,..., / 

(3) We fully analysed the optimal policy. Our analysis showed that the optimal policy 
actually gives the threshold probability. We also considered cases in which we reduce 
the problem either to one of maximizing power output alone or to one in which just 
fuel wastage is minimized, and found that in either case the threshold policy is what 
one expects from intuition. 

(4) Finally, we looked at the problem of explicitly computing the long-run average reward 
and the threshold probability. We obtained expressions for both of them in terms 
of Pi-discount optimal value function of a modified process, which can be easily 
computed. Further investigation could be carried out for the experimental verification 
of the algorithm. 


SB is indebted to Prof Vivek S Borkar for suggesting the approach to the problem and for 
many helpful discussions during the course of this work. 
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4 variational formulation-based edge focussing algorithm 
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Abstract. Many edge detection techniques exhibit scale-dependent distortion 
of edges. We develop two ideas, which may also be of independent interest, to 
produce sharp edge localization at all scales. The first is approximation of the 
functional associated with the variational formulation via epi-convergence, re¬ 
placing the edge set with a function. We provide a fast algorithm for minimizing 
the approximate functional. The second is to scale parameters and data to focus 
the edges. The resulting edge detector is a singular perturbation of a coupled 
pair of partial differential equations, yielding an elegant structure, suitable for 
digital or analog parallel implementation on mesh-connected arrays. 

Keywords. Edge detection; parallel implementation; mesh-connected arrays. 


1. Introduction 

The notion of ‘scale’, scale of features and scale of representation, is widely held to be 
of fundamental importance in vision. One reason for this is that hierarchical descriptions 
offer potential reductions in complexity of various visual processing problems. Coarse 
scale segmentation of an image, for example, can be used to identify regions of interest 
for further processing, thereby reducing the computational load. It is important, there¬ 
fore, that coarse scale descriptions retain those features of the data that are required 
for effective decision making. In the case of edge detection, T-junctions and comers 
play important roles in estimating the depth and shape of objects in a scene (Gamble & 
Poggio 1987). It is desirable, therefore, to accurately represent these features even at coarse 
scales. 

Coarse scale edges have the advantage of reduced complexity; fine scale edges are 
more convoluted but yield more accurate and detailed information. A problem many 
edge-detection algorithms exhibit is systematic distortion of high curvature edges, such 
as T-junctions and comers; this distortion is aggravated by operating on coarse scales. 

rrnrithmc wViir.b stTiooth tiic data iu 3- scalc-dependeut way 
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& Hildreth 1980) and the Canny (1986) edge detector are examples of this type. More 
recent approaches to edge detection combat the distortion of high curvature edges by in¬ 
troducing interaction between image smoothing and edge placement. Examples are the 
Markov random field formulations (Geman & Geman 1984; Marroquin 1985; Bilbro et al 
1992) and the related variational formulation of edge detection (Mumford & Shah 1985, 
1989; Blake & Zisserman 1987). There is evidence (Blake & Zisserman 1987) that edge- 
detection techniques of this type do exhibit smaller localisation distortion than those of 
the first class. Problems remain, however; distortions still occur, and, as before, the degree 
of distortion depends on the scale of the edge detector. Coarse scale edges exhibit greater 
distortion. 

Edge focussing aims at improving the localization of coarse scale edges without in¬ 
troducing fine scale edges. In this paper, we have derived an algorithm for performing 
edge focussing by starting with a variational formulation of an edge-detection problem. 
The resulting algorithm is described by a coupled set of nonlinear second order parabolic 
partial differential equations ((5)-(9) below) with explicit parameters ^ and c which are 
appropriate-adjusted (see (10)-(12)). The adjustment induces focussing of the edges. The 
global coarse scale nature of the edges is retained by introducing scale-stabilizing feedback 
mechanisms. The adjustment process commences after the nonlinear parabolic equations 
have neaily converged to their equilibrium. The set of equations (5)-(9) and (10)-(12) 
should really be viewed as an adaptive nonlinear filter which performs edge detection via 
focussing. Indeed, the equations are the fundamental objects in this theory and, apparently, 
are far more well behaved (for example, convergence to global minima) than the original 
variational problem. 

The foundation and motivation of the adjustment of the parameters lies in certain limit 
theorems for the variational formulation proved by one of us (Richardson 1990, 1992). 
These theorems are discussed in § 3. An outline of the variational formulation appears in 
§ 2. The algorithm is developed in a continuum setting in § 4, and refined and discretized 
in § 5. Simulation results can be found in § 6. 

The work presented in this paper is similar in spirit to that of Bergholm (1987) who 
focussed edges produced by the Marr-Hildreth edge detector. Since the variational formu¬ 
lation has better localization than the Marr-Hildreth edge detector, our approach requires 
less drastic adjustment of the edges. 

The edge-focussing algorithm is not the only point of interest in this paper. The vari¬ 
ational formulation is a mathematical model, not an algorithm. The primary difficulty in 
constructing an algorithm is appropriately representing the edges. One approach is to ab¬ 
sorb the edges into the interaction between neighbouring pixels. This idea appears in the 
anisotropic diffusion approach (Nordstrom 1990; Perona & Malik 1990), GNC (Graduated 
non-convexity) type algorithms (Blake & Zisserman 1987), and in mean field annealing 
(Geiger & Yuille 1989; Bilbro et al 1992). There is another approach, which we adopt in 
this paper, that has been developed within the framework of approximation (of variational 
principles) via F-convergence or epi-convergence (De Giorgi & Franzoni 1979; Attouch 
1984). This theory has been successfully applied to the variational formulation of edge 
detection by Ambrosio & Tortorelli (1990, 1992). Some computational work based on 
these results has appeared (March 1988, 1989, 1992). These approximations, and some 
further variations, are nresented in 8 9 tThp vnrintinnc 


of the robustness of the approach.) 

The approximation is achieved by refommlating the variational problem. The edge set 
is replaced with a function that modulates the smoothing of the image. The variational 
principle forces the function to have the appearance of a smoothed neighbourhood of the 
corresponding edge set. The degree of smoothing, i.e. the width of the effective edges, 
is controlled by a parameter. The ‘convergence’ of the approximation occurs by taking 
the parameter to the appropriate limit where the width of the effective edges tends to 
zero. One significant advantage of this reformulation is that the ‘edge’ function can be 
discretized in a straightforward way and minimization can proceed via the solution of 
discretized partial differential equations. In particular, the parabolic partial differential 
equations (5)-(9) comprise a ‘gradient’ descent on the approximate functional. Actually, 
in our implementation, we vary the step size of the ‘gradient’ descent and approximate a 
Newton-type descent (see (16)). By doing this we achieve very fast rates of convergence. 
Thus, we obtain a fast and elegant algorithm. 

Having a separate function to represent the edges, as opposed to absorbing them into 
the interactions between image pixels, has some advantages. Machine Vision researchers 
are interested in combining various low-level vision processes, intensity edge detection 
and stereo depth edge detection for example, into a single operation. Having an edge 
representation such as the one we employ may greatly facilitate this. 

It turns out that the fonn and the properties of the epi-convergent approximation mesh 
well with the parameter adjustment proposed for the edge-focussing algorithm. In partic¬ 
ular, one can argue heuristic ally, and demonstrate computationally, that with ‘wide’ edges 
some edge distortions are relaxed. The price paid for this is a drop in resolution of the edges. 
The adjustment attempts to produce the best of both worlds, relaxing the edges initially 
and then sharpening them as the parameters associated with the variational formulation 
are adjusted for finer scales. 

2. The variational formulation 

Mumford & Shah (1985,1989) suggested performing edge detection by minimizing func¬ 
tionals of the form 

£(/, r) = fi [ (/ - g)^dn + [ IVfl^dn + ain, 

Jq J^-r 

where is the image domain (a rectangle), /x is Lebesgue measure, g is the observed grey 
level image, i.e. a real valued function, F denotes the set of edges, IF | is the length of F, 
and and a are real positive scalars. This approach is a modification of one due to Geman 
& Geman (1984) using Markov random fields, which was developed by Marroquin (1985) 
and by Blake & Zisserman (1987). 

The three terms of E ‘compete’ to determine the set F and the function /. The first 
term penalizes infidefity of f to the data, while the second term forces smoothness of the 
approximation /, except on the edge set F. Thus, / is a piecewise smooth approximation to 
g. The third term forces some conservativeness in the use of edges by penalizing their total 
length. Roughly speaking, if g has a step discontinuity, approximating g with a smooth / 


causes the first term to be large, tracking g more closely with a less smooth / increases 
the second term, and allowing T to approximate the support of the discontinuity reduces 
both the first and the second terms at some cost to the third. 

This formulation was motivated in part by the desire to combine the processes of edge 
placement and image smoothing. Earlier edge-detection techniques such as the Marr- 
Hildreth edge detector, the Canny edge detector, and their variants separated these pro¬ 
cesses; the image is first smoothed, to suppress noise and control the scale, and edges are 
detected subsequently, as gradient maxima, for example. A consequence of this two-step 
approach is pronounced distortion of the edges, especially at high curvature locations. 
Comers tend to retract and be smoothed out; the connectedness of the edges at T-junctions 
is lost. The ‘finger-print’ images of gradient maxima of one dimensional images in scale 
space (Witkin 1983) are well known; the localization of edges degrades badly as scale 
increases. Many two-dimensional examples can be found in the literature. By introducing 
interaction between the edge placement and the smoothing it was expected that this ef¬ 
fect could be abated. There is evidence, both theoretical, in one dimension (see, Blake & 
Zisserman 1987), and experimental, in two dimensions, that this is indeed the case. 

2.1 Approximation and computation 

To compute minimizers of E the critical question is how to represent the set P. A natural 
approach is to discretize P into “edge elements” and treat them combinatorially, adding or 
removing elements in an attempt to minimize E. Appending a stochastic component leads 
to the simulated annealing approach first suggested by Geman & Geman (1984). This tends 
to produce computationally impractical algorithms. Modifications which incorporate the 
edge elements into the interaction between image pixels have been proposed. One of these 
is based on mean field approximations of the Markov random field (Geiger & Girosi 1989; 
Bilbro etal 1992) and another, GNC (Blake & Zisserman 1987), is based on a homotopy 
of the interactions. Both these approaches have their strong points, and are in fact quite 
similar (Geiger & Yuille 1989; Bilbro et al 1992). A novel approach has appeared from 
the mathematical theory of approximation of functionals via P-convergence, also known 
as epi-convergence (De Georgi & Franzoni 1979; Attouch 1984). We will use the later 
terminology to avoid confusion. We will not give a general definition of epi-convergence 
and refer interested readers to the references cited and also to Ambrosio & Tortorelli 
(1990, 1992). The definition of epi-convergence is designed to allow approximation of 
one variational principle by another. We consider functionals of the form 

Ec= f + 

+ a(cvp(u)lVu|2 -f (1 - vf/4c)]dp.. (1) 

Here <t>(u) takes the role of the P in E, i.e., it modulates the smoothness constraint on /. 
The other terms involving v force <P(v) to simulate the effect that P has in E. Implicitly we 
have 0 < u < 1. The algorithmic intention is to minimize Ec with respect to / and v. An 
obvious advantage the approximation offers over the original formulation is that v, since 
it is a function on ^2, can be discretized in a straightforward way and (local) minimizers of 



t if one sets 


cl>(i;) = u^, and = 1, (2) 

n Ec epi-converges to £ as c -> 0. Some computational results for this functional have 
iady appeared (March 1989) (see also Shah 1991), and a scheme similar to the one 
sented here has been applied to the stereo-matching problem by March (1988). 

Fhe choice for O and ^ given above may be one of the simplest possible but it is not 
que. The first functional which was shown to be epi-convergent to E (by Ambrosio 
Tortorelli 1990) has the form of (1) with the formal substitutions 0(u) = cfi^(u) = 
- (1 — i?)^)^^ . When one considers algorithms based on these functionals there ai'e 

ie-offs to be made between speed and performance. For example, the choice reflected in 
leads to simple equations and fast computation. However, the more complicated choice 
ntioned above produces sharper singularities in <I> and hence less smearing of / near 
edges. With slight modifications, the proof of epi-convergence found by Ambrosio 
Tortorelli (1990, 1992) can be made to go through for a large class of 'T and O. In 
ticular, one can choose to be any function satisfying 

fl^(jc) > 0, forjc € (0, 1], 

2 / (1 -M)vI/^/^(w)dM = 1. 

Jo 

te that any function satisfying the first property can be made to satisfy the second 
)perty by suitable normalization. Given such a 'F one can choose to be any 
iction satisfying 

^( 1 ) = 1 , 

<D(0) = 0, 

0(x) e (0,1) forx € (0, 1). 

though the conditions given above are sufficient for the proof of epi-convergence, for 
[orithms based on ‘gradient’ descent one should also impose the condition that be 
motonically non-decreasing and O be monotonically increasing on (0, 1). Furthermore, 

‘ our implementation, which is discussed in §5, the condition lim;c-^.o ^(x)/^ = 0 
3uld be imposed. Even more general and d> than defined above are possible. For 
ample, setting 

vF(i;) = <I)(i;) = (3) 

o produces an epi-convergent set of functionals. Examples in the class defined above 

(m-f l)^(m + 2)^ ... 

<p(v) = v^ and ^f(v) = - - - , (4) 

lere m > 0 and n > 0. Equation (2) is a special case with (n, m) = (1,0). 

Suppose £(/, F) < 00 . The proof of epi-convergence involves basically two 
;ps. The first is to show that for any sequence {/q, Uq} where c, 0, fa -> /, 


and Vcj 1 in an appropriate sense (not pointwise), that liminf/_^oo Eciifa, Vci) > 
E(f, F). The second is to construct a sequence such that limsupj_^oo £c, (/c/. < 

E(f, F). If (/, F) minimizes E then this second step requires constructing near minimiz- 
ers of Ec- If liminf/-»co £c/(/c, i ^c,-) < oo then, roughly speaking, if x e F one has 
hm/_»oo = 0- (Thus at these points we do not have Uq(x) —> 1, however, 

F is a set of /li measure 0.) On the other hand, the last term in (1) forces Vc(x) to con¬ 
verge to 1 for almost all x € Q (in the sense of Lebesgue measure) and hence one has 
limc-H-o ^>(uc(x)) = 1 for almost all x e ^2. The near minimizers of Ec are constructed by 
setting <J)(uc(x)) ~ 0 on F and 4>(i)c(.^)) — 1 outside some neighbourhood of F with a 
smooth transition in between. The approximations indicated here become equalities in the 
limit as c -> 0. The width of the transition depends on and on c. We give a brief heuristic 
description of how this occurs. In the transition region we expect / to be relatively smooth 
so only the terms not involving / in Ec will have a significant effect on the form of v there. 
In the following inequality, 

c'k(i;)lVu|2 4- - ~ 

4c 

the equality holds only if |Vi)l = 'k“^/-(u)[(l — u)/2c]. This suggests that (in one di¬ 
mension) if Uc(t) satisfies duc(t)/dt — [(1 — Uc(t))/^c]^~^^^(uc(0) with d>(M(0)) ~ 0 
then setting i)(x) = M(:(dist (x, F)), for dist (x, F)) < Zc where d>(Mc('tc)) — 1 (with 
Mc('tc) ^ 1 as c 0), will produce near optimal transitions. This is how the near optimal 
Vc are constructed by Ambrosio & Tortorelli (1990, 1992). Note that assuming that udO) 
does not depend on c we obtain udt) = u\it/c). Thus the edge width is proportional to 
c. Let y(s) = /q (1 — r)'h^/^(r)dr. We now compute 


L 


Xc 


C'T(M(t)) 


duc(t) 


~-L 

= 1 /. 


dt 

u(t)) 




ducit) 


dt 


Tc a 

—y{u(t))dt 
0 at 


= yM ^ - 


4c 


(1 — u{t ))) dr 
1 


with the last approximate equality becoming equality in the limit as c —> 0. In the one¬ 
dimensional case we now see that the last term in (1) will contribute approximately a times 
the number of discontinuity points of /. In two dimensions one obtains approximately a 
times the length of jF]. Thus, we see that Ec approximates E. 

Our edge-focussing algorithm will be implemented as a singular perturbation of a de¬ 
scent on a discrete version of Ec- We will briefly consider the continuum equivalent. 
Define 


dfEc = ^(/ - g) - V . (O(u)V/), (5) 

dyEc = ^(v)a-^\Vf\^ - cV • (4/(u)Vi;) 

-t-2cvL(u)!Vul^ -F (1 - u)/2c, (6) 

= - 2c'T(u)Au) - c^(u)|Vu|^ + (1 - v)/2c. (7) 
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The Euler-Lagrange equations for / and v are, respectively, dpEc = 0 and d/Ec = 0 with 
Neumann boundary conditions on both v and /. Allowing / and v to depend on t we can 
write a ‘gradient’ descent on E^ in the form, 

~f(x,t) = -CfdfE, (8) 

9 

— v(x,t) = -CydfE, (9) 

at 

where Cf and Cy control the rates of descent; they would be constant for a strict gradient 
descent, but may not be in general. In our implementation Cy is not constant. Since the 
functional Ey is not jointly convex in v and / we do not expect to always reach a global 
minimum by a descent method. Thus the solution obtained will depend on the initial 
conditions and also on the parameters cf and Cy. 

Equation (8) strongly resembles the anisotropic diffusion scheme of Perona & Malik 
(1990) and, even more strongly, the ‘biased’ anisotropic diffusion scheme of Nordstrom 
(1990). Perona & Malik (1990) begin by considering the standard diffusion equation, 

^f(x,t)=^Axf{x,t), f(x,0) = g(x), 

at 

which produces the ‘scale-space’ smoothings of g, parametrized by t. For fixed r one 
obtains f(x, t) by convolving g(x) with a Gaussian kernel whose variance is linear in t. 
Perona & Malik (1990) suggested controlling the diffusion coefficient to prevent smoothing 
across edges. Thus they were led to consider equations of the form 

~J(x, 0 = V;, . (h(\V,f(x, t)\mf(x, t)), fix, 0) = gix). 
at 

They experimented with h(s) = 7/[l 4- (s/K)^] and h(s) = , where J and K 

are constants. Equation (8) resembles this equation in that it is a diffusion with controlled 
conductivity. The control of the conductivity depends on |V/| indirectly through (9). The 
term fif — g) in (8) stabilizes the solution at some particular scale. Such a term also 
appears in the ‘biased’ anisotropic diffusion scheme studied by Nordstrom (1990). Perona 
& Malik (1990) analyse their scheme to show that the maximum principle holds, i.e. that the 
solution’s extrema never exceed those of the original image. They argue that this implies 
that no new ‘features’ (blobs) are introduced into the solution. Here, as in Nordstrom 
(1990), this property is a trivial consequence of the formulation. The functionals Ey would 
increase if such new features appeared. (Truncating such a new feature would decrease 
Ey.) An advantage of the scheme presented here is that it yields an explicit representation of 
the edges (via the function 0 (u)). The resulting system of equations admits a particularly 
simple implementation in digital mesh connected parallel machines with simple processors 
or, potentially, in an analog network, such as discussed by Harris et al (1989). 

3. Scale, noise, and accuracy 

Witkin (1983) introduced the idea of a scale-space representation of an image - smoothing 
the image and identifying edges on all scales. One of the problems arising from the scale- 
space concept was the correspondence problem: Which of the fine scale edges correspond to 
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Non-minimal Geometries Conesponding Mininial Geometries Figure 1. Calculus of variations results. 

the coarse scale ones? The problem is aggravated by the fact that the distortion of the edges, 
mentioned earlier, depends on scale. Typically ‘coarse scale’ implies more smoothing and, 
hence, more distortion. This is undesirable in many situations because salient features, 
comers and T-junctions for example, tend to be obscured. The correspondence problem is 
therefore of great importance. The edge-focussing algorithm of Bergholm (1987) addresses 
the correspondence problem by taking small steps in scale. We address it this way too, in 
effect, but also by using a better underlying edge detection algorithm. 

Since the variational approach combats the distortion caused by smoothing one hopes 
that the correspondence problem will be alleviated by using it. Although this appears to be 
the case, problems remain; there still are distortions, depending on scale, and the model 
intrinsically restricts the geometry of possible edge sets in an unnatural way. The analysis 
of Mumford & Shah (1985) showed that edge sets produced by the variational approach 
have the following properties, which are illustrated in figure 1. 

If r is composed of arcs then 

• at the most three arcs can meet at a single point and they do so at 120°, 

• they meet 3^ only at an angle of 90°, 

• it never occurs that exactly two arcs meet at a point (other than the degenerate case of 
two arcs meeting at 180°), i.e., there are no comers. 

These results derive firom the fact that the term | F | in £ locally dominates the behaviour 
of singularities in F. Hence the types of singularities observed are identical to those of 
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lirnal surfaces. Of course ‘real’ edges are not restricted to these geometries. The de- 
dence on scale derives from the interaction between the singularities of / and those of 
R.oughly speaking, smaller values of ^ produce greater distortion. This becomes more 
IT in light of the theorems quoted below and the analysis presented by Mumford & Shah 
89). 

n view of the foregoing it is natural to ask what are the noise/accuracy/scale tradeoffs 
i, more generally, whether scale-dependent distortions are necessary. Is it necessary that 
mechanisms used to combat noise, i.e. smoothing, be tied simultaneously to scale and 
uracy? We propose that the accuracy of edge localization need not be limited by scale, 
j central ideas of the method we will develop can be gleaned from some asymptotic 
orems proved by one of us (Richardson 1990, 1992) concerning minimizers of E. We 
1 now present a slightly simplified statement of those theorems, 
fo measure the disparity between one edge set and another we introduce the Hausdorff 
trie. For A C IR" the e-neighbourhood of A will be denoted by [A]e and is defined by 
le = {;c 6 iR“ : inf 3 , 6/1 1 |x — y 11 <6} where 1| • || denotes the Euclidean norm. Denoted 
•), the Hausdorff metric is evaluated by 

dniA, B) = inf{€ : A c [B]^ and B C [A^}. 

mentary analysis shows that is a metric on the space of non-empty compact sets 

Suppose for the moment that we have ideal data: g is a piecewise smooth function, 
make this clear we denote it by g/. We assume that there exists a set Fg, a union 
curves, satisfying length (Eg) < oo such that g/ is discontinuous on Eg and smooth 
ewhere. More precisely we require that |Vg/pdjU. < oo, that there exists a 
istant L such that g/, restricted to any straight line segment lying in — Eg, is a Lipschitz 
iction with Lipschitz constant L, and that g actually has a discontinuity everywhere on 
except, possibly, for a set having zero total length. Under these conditions the following 
ids. 

leorem 1. For any fixed o; > 0 and e > 0 there exists < oo such that if and 

is minimal for E then 

r^g) < 6. 


e thrust of this theorem is that as oo the set E^ can assume arbitrary geometries 
1 will coincide with the discontinuity set of g/. The theorem can be interpreted as 
asymptotic fidelity result for the variational approach. It implies that the distortions 
ulting from ad hoc functions of this type are local, small-scale effects. 

From a practical point of view the result is lacking because fi oo forces / to match g; 
ictly; noise in g/ will result in the appearance of many spurious boundaries. However, 

; theorem can be extended to incorporate noise and smearing effects. Roughly speaking, 
the admissible noise magnitude scales as and the admissible smearing acts 

er a radius of then their presence can be tolerated and the theorem still holds. To 

^e a more precise statement of this, let hr and hx be any positive real valued functions 


saiisiying 


lim ^hr(P) = 0, 

)3-»oo 

lim = 0, 

and let ^(jS) be the set of all functions g which can be written as 


g = Srigi) +X0), 


where 0 < A. < /zx(^) is a scalar, co e L°°(Q) with [toloo < 1, ‘S,- is any smearing operator 
satisfying 


Sr fix) e 


min /(y), max f(y) 

\_y€Br{x) yeBrix) 


where Br(x) is the disc of radius r centred at x with 0 < r < hr(f), and gi is the ideal 
data described earlier. We then have the following. 


Theorem 2. For any fixed a > 0 and e > 0 there exists p* < oo such that if P > P* and 
r ^ is minimal for E for some g e'^^iP) then 

dniXg, V^) < €. 

The proofs are beyond the scope of this paper (they require a lengthy function analytic 
development) and have appeared in Richardson (1990, 1992). The relevant mathemat¬ 
ical framework is outlined in Ambrosio (1989). This theorem indicates how noise and 
localization defects should scale with the parameter p to maintain fidelity. 

Remark. The requirement \imp.^oo P^^^hxiP) — 0 seems to be necessary in general. 
There are certain images which asymptotically produce spurious edges for the piecewise 
constant version of the variational formulation if this condition is violated (Richardson 
1990, 1992). 

Our immediate goal, fulfilled in the next section, is to describe a heuristic which mimics 
the theorems to produce accurate localization of the edges in a manner independent of 
scale. Two problems immediately suggest themselves. First, a real image has fixed noise 
which cannot be scaled since it cannot be identified, and secondly, smearing is fixed and 
cannot (in general) be removed. The intrinsic noise and smearing of an image limits the 
recoverable accuracy of the edge locations. We are not proposing a scheme to eliminate that. 
Edge detection, ostensibly, will be performed on different scales. As we have indicated, 
operating on coarse scales tends to introduce distortions above and beyond those inherent 
in the signal. Our goal is to eliminate those distortions and recover, even on coarse scales, 
the same accuracy of localization usually achieved only at fine scales. 

Since we intend to use an approximation to the variational formulation, it is prudent 
to consider whether the approximation deviates in a significant way from the original 
formulation with regard to distortion of edges. Although analysis is prohibitively difficult, 
we expect (and simulations have borne this out) that the spreading of the edges in the epi- 
convergent approximation actually ameliorates some of the geometric distortion. We recall 


that the primaiy reason for the geometric distortion is that the term | F [ in determines 
the structure of the singularities. Roughly speaking, this arises because the length term is 
one-dimensional and scales linearly while the other terms are two-dimensional integrals 
and hence scale quadratically in the size of the domain. (Actually this is not precisely 
true because singularities arise in /, but the dominance of the length term still occurs at 
singularities in F.) When the edges are smeared and length is replaced by a two-dimensional 
integral the concentration of cost in the length term is alleviated and, hence, we expect 
the distortion to be relaxed. The price paid for this is the lack of resolution of the edges. 
The algorithm begins with thick edges, thus relaxing distortion, but ends by sharpening 
the edges while scaling the parameters in accordance with theorem 2. Thus the edges are 
focussed as resolution increases. 


4. Edge focussing via scaling 

In this section we describe in detail the modifications to the descent equations, (8) and 
(9), which we introduce to focus edges. The essential idea is to satisfy the conditions of 
theorem 2 by smoothing g in a controlled way while allowing oo. Simultaneously we 

allow c 0 to sharpen the edges. We draw an analogy between the width of the edges and 
the diameter of the smearing operators appearing in theorem 2. Thus the rates of change 
of ^ and c are coupled. 

We consider introducing dynamics into quantities yS, g, and c, which in a standard 
minimization of Ec would be held fixed. These dynamics are intended to come into effect 
only after the basic descent equations (4) and (5) have essentially converged. Thus we 
assume g(x, 0) is the initial data and /(x, 0) and u(x, 0) satisfy their respective Euler- 
Lagrange equations with = y3(0) and g(x) = g(x,0). This implies the presence of 
a nominal set of edges, i.e. a function u(x, 0). We will be guided by the heuristic that 
the subsequent focussing should only focus the edges already found and not introduce 
new ones. We make the following correspondences with the quantities which appear in 
theorem 2, 

g(x, t) <—> Sr{t){gi) + X(t)w{x, 0, 

f(x, t) < - Sr{t){gl), 

where r(t) — jfirc(r) for some constant 

l^(01oo = lg(x, 0 -/(^, Oloo, 

and |w;(x,r)|oo = 1- 

We will discuss the choice of K and the meaning of the correspondences shortly. Consider 
first the following equations, 

9 

y, t) = €{f{x, y, t) - g(x, y, t)), 
at 

where € is a small positive constant included to reflect the fact that these equations are 
perturbations of (8) and (9). We observe that we obtain a solution to these equations such 




fix, t)~fix,0), 

^(0 = ^(0)e^^ 

six, t) ~ g(x, 0) + fix, 0)(1 - 

In fact, if (/(x,0), i>(x,0)) minimizes Ec with data gix, 0) and parameter ^(0) then 
it is easy to see that (/(x, r), d(x, ?)) minimizes Ec with data g(x, t) and parameter 
j3f)- These equations show that with this scaling we have y3(0 -> oo while the minimal 
solutions vix,t) and fix,t) remain fixed. Interpreted in view of theorem 2 this means 
that gi con'esponds to fix, 0) (i.e. K — 0). Note that according to our correspondences 
we have kit) a f~^it), so the scaling conditions of theorem 2 are satisfied. We will now 
alter these equations slightly. First, we will introduce some dynamics into c to sharpen the 
edges. Second, we suppress the smoothing of g in a neighbourhood of the edges which 
shrinks with time; this is to permit focussing of the edges. Thus we consider 

9 

—gix,t) = 6pivix,t))ifix,t)-gix,t)}, (10) 

at 

= ( 11 ) 

at 

Q 

—cit) = -6cit), (12) 

at 

where pivix, t)) should be approximately zero inside some neighbourhood of the edges 
and approximately one outside some larger neighbourhood. Furthermore, the widtli of 
the larger neighbourhood should shrink as f~^it). A simple and reasonable choice, for 
example, is p = d) since in this case the neighbourhood width is proportional to c(t), 
which in turn is proportional to it). The algorithm takes the form of (8) and (9) until a 
local minima is reached, and subsequently (10)-(12) come into effect. Assuming gix, t) 
and fix,t) converge, we interpret the limit gix, oo) = fix, oo) as corresponding to g; 
and the set 5^ = {x : 0(u(x, oo)) ~ 0} as corresponding to F^. The quantity K does not 
appear in our equations and is meant only to facilitate the correspondences; it should be 
interpreted as being sufficiently large so that the set {x : pivix, t)) < 1 — 5) for some 
small positive constant 5 is contained in [5'g]r(f). We expect that fix,t):^ fix, 0) for all 
^ [5g]r(0)- Given this, the correspondence of fix, t) with ^r{t)igi) is consistent with 
the noise scaling of theorem 2. 

In general the choice of p is a delicate issue. If g is noisy it may be desirable to allow more 
smoothing near the edges. The price for this is admitting potential distortion into the edges. 
In this situation a better smoothing mechanism might be a directionally controlled diffusion 
of g, suppressing diffusion across edges but enhancing diffusion parallel to the edge. This 
can easily be implemented within the framework developed here since Vd>(u(x)) will be 
perpendicular to an edge in a neighbourhood of that edge. (We have not experimented with 
this alternative.) Even when noise is not an issue p must be chosen carefully. We contend 
that the ideal choice should allow for edge correction and adequate smoothing without the 
introduction of edges from finer scales. 


ne conditions, is to allow fi to depend on x and to scale it only around the edges. For 
imple, one could replace (11) with {S/dt)fi(x, t) = €{1 - p(v(x, t)))fi(x, t) andelimi- 
e the smoothing of g, (10). We have experimented with this variation in our simulations 
1 the results are similar to those presented here. 


Discretization and parameter choices 


this section we address some of the issues which arise as a consequence of dis- 
tization and further refine the algorithm. The lattice spacing (we will consider only 
lare lattices) can be absorbed, via scaling, into the other parameters, /3, c, and a. 
propriate step sizes for the discrete versions of the descent algorithm must be 
ind, and the relative rates of the gradient descent and the scaling dynamics must be 
nded. 

For the simulations presented in this paper /, g and v are discretized in a manner 
scribed below. Discrete versions of / and g are defined on a rectangular subset of a 
tare lattice while the discrete version of v is defined on a similar subset of a square 
tice which is twice as dense and rotated by 45°. This is not necessary, but it facilitates 
; discrete implementation. We define the following subset of Z“ C 1R“, 

= j = \ . M}. 

i assume that g is defined on £/. The nearest neighbours of x e £/ are defined by 
Affix) = {x' € Z“ : |x' — x| = 1}. 
nilarly, we define 

£d = ((X“f‘X)/2’. x€£^,x g Aff (x) n df ), 

lich will support a, and the nearest neighbours 

U„(y) = {y' = (x + x')/2 ; x e £/, x's Uf(x), \y - y'\ = I/V 2 I. 

r discretization we can take the approach of discretizing Ec and then deriving discrete 
ler-Lagrange equations, or we can discretize the Euler-Lagrange equations directly. 
5 consider the first approach first. A discrete version of Ec with lattice spacing 8 is the 
lowing. 



+ x')/2)ifix)-fix')y 


x'€Affix) 


+“ E 

ye£v 






lere if x e £/ and x' e Aff ix)\Cf then we impose the condition fix') = fix) (this 
fines fix') uniquely), and if y G £v and y' G Afviy)\Cv then we impose the condition 
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v{y') = where y" e Mviy) is the unique point satisfying \y' — y”\ = 1. This is 


to ensure Neumann type boundary conditions in the descent equations given below. By 
making the substitutions 






a 


S~-a, 


we find that without loss of generality we can set 5 = 1, which we do henceforth. A 
discrete form of the Euler-Lagrange equations can now be found directly by differentiating 
Ed with respect to ufy) and f(x). To simplify the notation we will write ^(y), 0(y) 
instead of T'fuCy)), Ofufy)). For each y e the pair a', x' such that y = (a + x')/2 
and X e Cf,x' e Affix) D Cy is uniquely determined. Thus, for each such y we can 
set d/(y) = (fix) — fix'))^. The derivatives of ^ and <J> will be denoted 'k and 4> 
respectively. For each x e Cf and y g Cy we define 


dxEd = ^(f(x) - gix))-\- x')/2)if(x) - fix')) 

x'eMfix) 


(13) 


ByEd=ci ^^iy)dfiy)- 


1 - y(y) 


4c 


/e7Vi,(y) 


X (^^(y) + ^(y') + ^'k(y)(u(y) - u(yO)^ , 


(14) 


which are proportional to [d/dfix)]Ed and [d/dviy)]Ed respectively. These equations 
are discrete analogs of (5) and (6) respectively. (The constants in (14) are slightly different 
from those in (6) because the lattice spacing of Cy is 1 /\/2.) Now we consider discretizing 
the Euler-Lagrange equations directly. A discrete version of (7) is 

dyEd = (x~^<biy)dfiy) - — + c ^ (i!(y) - viy')) 

/eA/\,(.v) (-15^ 

1 


X ( 24^(y) - -4/(y)(u(y) - j;(y')) 


A third alternative is to average (6) and (7) and discretize, or equivalently, to average (14) 
and (15). If we do so, we obtain 


dyEd = a ^^iy)dfiy)- 


1 - u(y) 


4c 


+ c (^(3^) - ^^(y)) (l'J^(y) + ^^(yO) 


y'&Mviy) 

From the point of view of implementation this form is much more attractive since it 
simplifies computation. Finally, one could approximate 3/24/(y) + 1/24^ (yO ~ 24'(y) 
to obtain an even simpler form. Although this deviates from the theory, we use it in our 
implementation. 

Allowing all quantities to depend on t, our basic descent equations take the form. 


fix, t + 1) - fix, t) = -CfdxEd, 
u(x, t 4-1) - vix, t) = -Cyd^,Ed, 
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Pixel 

Figure 2. Graphs of minimal <t>(D(25, j + 0.5)) for step edge for c = 2.0, l.O, 0.5, 0.2. In 
each case a ~ 0.05, ^ = 2.0, and <t> and were cho.sen as in (2). 


lere on the right hand side all quantities are evaluated at time t. We now address the 
estion of choosing c/ and Cy. A standard gradient descent would have both Cf and 
constant. If we try to set Cy constant then it must be chosen small since {f{x) — 
x'))^/a can be quite large and hence convergence will be slow. A computationally effi- 
int choice that gives much faster convergence is to approximate a Newton-type descent. If 
; set 

1 / 4>(y) 1 X"' 

^v(y) = - -^ci/(y) -b - + 8cny) , 

2 \av(y) 4c j 



Figure 3. Graphs of minimal 4 >(d( 25, j + 0.5)) for step edge for a = 0.05, h — 2-0, 
and c = 0.5 where and 'I' are given by (4) with (n, m) given by (2,0) (2,1/2), and 
(2,1) respectively where increasing m increases the spread of the function. 




above, then we obtain 


^ l/4c- + 2c>i>(y)EyeAf.,o-)»'(y) 

<^(y)df{y)/(aviy)) + l/4c + ^c^(y) 

Our implementation employs this form of update. It has particularly attractive properties. 
Note, in particular, that u(y, r + 1) is an average of v(y, t) and a well-behaved quantity 
which lies between 0 and 1. Setting Cf constant is much less problematic, and for our 
simulations we have set it to (2^ ■+■ 8)“^ since this gives a good rate of convergence 
without allowing overshoot in /. The initialization of / and v will affect which local 
minimum is reached by the initial gradient descent. We expect this will have little effect on 
the edge-focussing part of the algorithm. In our simulations we have set /(O) = g(0) and 
v(0) = 1. With these choices we observe that the basic descent on / and v converges in 
about 30 iterations for the range of parameters we have experimented with. (Larger values 
of c and smaller values of will reduce the rate of convergence.) 

We now consider the discrete scaling dynamics. We recall that these equations come 
into effect only after the gradient descent has nearly converged. The following are discrete 
analogs of (10)-(12), 

g(x,r-l- l)=g{x,t) + €p(x,t){g(x,t) - f{x,t)), 

^(r + l) = (i 

c(t + 1) = (1 -6)c(t), 

where p and e are to be specified in each case. Approximations that save computation can 
be made here. For example one could compute this update once every n time steps instead of 
every time step and increase 6 appropriately. Also, since /(x, t) remains essentially fixed 
outside of some neighbourhood of the edges, one could restrict the update on ^ to some 
neighbourhood of the edges and completely drop the update on g producing essentially 
the same effect, as mentioned earlier. This would yield substantial time savings in serial 
implementations. Our implementation employs the equations given above. 

If c(t) becomes too small then the discrete approximation to Ec will break down. The 
value of c(0 can be used as a stopping criteria. For the choices of ^ and <I> used in this 
paper we allow c(f) to become small enough so that the effective edge width is one pixel. 
(Effective edge width can be defined as the width of the set {d> (y) < 1 /2} for example.) 

6. Simulation results 

For all of our simulation results we have scaled g so that g(x) € [0,1 ]. In particular, solid 
white is 1 and solid black is 0. In the two-dimensional plots and in the images we plotted 
(u) on the same mesh as /, i.e. the plotting mesh corresponds io Cf. For each x e £/ we 
plot the minimum value of 0(u) among the four nearest neighbours. (Note that an edge 
‘set’ could be defined as {y g ^ ^(^(y)) < 1/2.) 

The first two simulations illustrate basic properties of the epi-convergent approximation. 
Here g is defined on a 50 x 50 square mesh with g(i, j) = 0 for j < 25 and 1 otherwise. 
Figure 2 illustrates dependence of the edge smearing on c. We plot 0 (u(25, j -f 0.5)), j = 


i>(y, r + 1) = - 
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Figure 4. (a) Synthetic T-junction/corner data, (b) Minimal / with v = 1 with f = 0.1. 
(c) Minimal / with f = 0.1, ^ =0.1, and c = 2.0 (d) Minimal with f = 0.1, or = 0.1, 
and c — 2.0 (e) Minimal / after edge focussing with final value.s of /S = 1.0, a =0.1, and 
c = 0.2. (f) Minimal 4>(u) after edge focus.sing with final values of = 1.0, or = 0.1, and 
c = 0.2. 


.., 50 for c = 2.0, 1.0, 0.5, 0.2 after convergence of the descent equations (without 
Ling). In each case a = 0.05, ^ = 2.0, and O and 'I' were chosen according to (2). 
ure 3 is similar except that we have fixed c = 0.5 and varied and They are given 
(4) with (/ 2 , m) = (1, 0), (1, 1/2), (1, 1) respectively (increasing m tends to increase 
width of the edge). This is to illustrate how the shape of the edges can be changed by 
ring and 4^. 

0 demonstrate the behaviour of the edge focussing algorithm on high curvature edges 
have simulated the algorithm on the data presented in figures 4 a-f. The functions 
4^ are as in (2) and we set p(v) = <$>(u). We have carefully chosen the parameters 
nake the detection of the comer marginal. The initial values are /5(0) = 0.1, a = 0.1, 




























































Figure 5. (a-d) (Caption on facing page.) 


and c(0) = 2.0. The final values are ^{T) = 1.0 and c{T) = 0.2 (where T is the time of 
termination), and e was chosen so that 200 iterations with scaling are required. Figure 4a 
is the data set g. Figure 4b plots the optimal / when a = oo, i.e. no edges are allowed. 
This is to indicate the degree of smoothing associated with this value of p. Figure 4c 
shows / (0), i.e. the / obtained after allowing a descent to converge with the parameters 
held fixed at their initial values. Figure 4d shows d>(u(0)). We have chosen the parameters 
so that the three edges in the image are detected to different degrees. Figures 4e and 4f 
/(T) and <$>(v(T)) respectively. Note that the two larger edges and the comer they 
form is unambiguously detected while the smallest edge has been smooth out. The slight 
smoothing visible across the second largest edge is due to slight smoothing feedback at 
that edge. The function g(T) is not distinguishable from f(T) so we have not included it. 

Figures 5 and 6 demonstrate the algorithm on ‘real’ images. Figure 5 is 480 x 512 pixels 
and figure 6 is 512 x 512 pixels. In general € is chosen so that 200 iterations with scaling 
are required. The data are in figure (a) in each case. Each image has been processed for 
two different sets of parameters to indicate the stability of the edges under a change in 
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Figure 5. (a) Statue image (Paolina Borghese, Canova circa 1800) 480 x 512. Prescaling 
<!>(?;) (b), final 0(u), (c), and final / (d), with initial parameters f = 0.04, a = 0.01, 
c(0) = 1.0 and final parameters f = 0.2, c = 0.2. Prescaling cf>(u) (e) final (f), and 
final / (g), with initial parameters f = 0.1, a = 0.01, c(0) = 1.0 and final parameters 
fi = 0.5,c = 0.2. 

lie. The parameters a and f are an order of magnitude smaller in figure 6. This admits 
ich greater smoothing of the image. Even so, edge localisation is accurate. In both cases 
: displayed images are the following. Figures (b), (c), and (d) are <i>(i»(0)), /(T), and 
v{T)) respectively. Figures (e), (f), and (g) reiterate (b), (c), and (d) for the second set 
parameters iii each case. In figure 5 the first set of parameters is given by 

yS(0) = 0.1, O'= 0.25, c:(0) = 2.0, f{T) = l.0, dT) = 0.2, 

1 the second by 

y6(0) = 0.1, O'= 0.1, c(0)=2.0, ^(7) = 1.0, ciT) = 0.2. 

figure 6 the first set of parameters is given by 
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and the second by 


Figure 6. (a-d) (Caption on facing page.) 


0.004, a = 0.03, c'(0) = 2.0, 


c(T) = 0.2. 


We observe that as scale increases the set of edges detected increase monotonically — 
virtually no scale-dependent distortion is visible. 

This work was partially supported by the Office of Naval Research under contract 
NOOOl 4-77-0532, by the Air Force Office of Scientific Research under contract AFOSR- 
85-0227 and by the Army Research Office under contracts DAAG-29-84-K-005 and 
DAAL03-86-K-0171. Some of this work was done while TJR was at the Laboratory for 
Informations and Decision Systems, MIT. 
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Abstract. We consider here a linear programming problem whose rows of 
the constraint matrix can be partitioned into two parts. Such natural partitions 
exist in several special linear programs, including the assignment problem, the 
transportation problem, the generalized upper-bounded variable problem, the 
block diagonal linear program; and can also be used to induce sparsity pat¬ 
terns in Cholesky factorizations. In this paper, we propose a matrix partitioning 
method for interior point algorithms. The proposed method independently gen¬ 
erates Cholesky factorizations of each part, and reduces the complexity to that 
of solving generally, a dense linear system involving several rank one updates 
of the identity matrix. Here, we propose solving this linear system by an induc¬ 
tive use of the Sherman-Morrison-Woodbury formula. The proposed method 
is easily implemented on a vector, parallel machine as well as on a distributed 
system. Such partitioning methods have been popular in the context of the 
simplex method, where the special stmcture of the basis matrix is exploited. 

Keywords. Linear programming; interior point methods; Cholesky factor¬ 
izations; matrix partitioning methods; vector-parallel machines; distributed 
systems. 

Introduction 

5 consider here the linear programming problem 

min c^x 

Ax = b 

X >0, 

lere A is a m x n matrix of rank m, b is an m vector and c is an n vector. We assume 
It the matrix A has the partition 



Here G and H, respectively, are m i x « and m 2 x n matrices, and g and h are, respectively, 
and m 2 vectors, with mi -|-m 2 = m. We also assume that such a partition is determined 
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either by the structure of the problem or has been induced by sparsity considerations. 
Specially structured linear programs that have this natural partition include the assignment 
problem, the transportation problem, the generalized upper bounded variable problem and 
the block diagonal linear program. Such partitions may also be considered to arise in 
situations where rows are added, iteratively, to a linear program, and one of the set of rows 
G or H may be considered to be the ones added. 

Many variants of the simplex method exist which exploit the induced special structure of 
the basis matrix. We refer the reader to Lasdon (1970) for a background on these variants. 
Our goal, in this paper, is to consider the application of the recent interior point methods 
to this partitioned linear program. The study of these methods was started by the seminal 
work of Karmarkar (1984). Some notable papers that are related to this work are Barnes 
(1986), Kojima etal (1989) and Vanderbei etal{\9^6). The reader is encouraged to browse 
through these, the recent book by Fang & Puthenpura (1997), and many references therein, 
for an introduction to these methods. Our starting point in this paper is the implementation 
of these methods, which requires the solution of a linear system 

Cz = d (2) 

where C = ADA^ for some diagonal matrix D; and d, as a function of A, b, c and D, is 
determined by the particular interior point method employed. In this paper, we reduce (2) 
to solving an m 2 X m 2 system of the form, 

a-EE^)u = q, (3) 

where E is an m 2 x mi matrix. If Ej is the y th column of the matrix E, it can be readily 
seen that 

mi 

I-EE^ = I-^E;EJ, (4) 

and we view (4) as mi rank one updates to the identity matrix I. We solve the system 
(4) by an inductive version of the Sherman-Morrison-Woodbury formula. This inductive 
method requires 0(m2m^) multiplications, and is advantageous over inverting I — EE^ 
when m 2 > m\. Another advmtage of this method is its ready implementation on a vector 
and a parallel/distributed computing environment. 

In § 2 we present partial Cholesky factorization, the basic idea behind the partitioning 
technique; and, in § 3 we present a technique to handle several rank one updates. In § 4 we 
present the variant to handle the transportation and assignment problems, in § 5 the GUB 
and in § 6 the block diagonal linear program and the multicommodity flow problem. Finally 
in § 7, we give some preliminary computational results comparing the transportation variant 
with LOQO, Vanderbei (1992). 


2. Partial Cholesky factorization of ADA^ 

In this section we generate the general theory we will use to exploit the partition (1) of A. 
The main result we need for this nnmnse is thp fn11nu/in<r thpnrpm* 



uemxm lower triangular matrixl^ with positive diagonal entries, such that C = LL^. 
ise C is dense, — |m multiplications are required to compute L. 


^f. See George & Liu (1981). 

le matrix L in the above theorem is called the Cholesky factor of C. Given that 
sparse, we refer the reader to George & Liu (1981), an excellent reference on the 
lodology that preserves sparsity of L. We now use this theorem to exploit the partition 
if the rows of A. 


ssuming this partition, it can be easily seen that 


ADA^ 


GDG^ GDH^ 
HDG^ HDH^ 


( 5 ) 


len we can prove: 


ma 1 . There exist lower triangular mi x mi matrix Lj and m2 x m2 matrix L3 such 
GDG^ = LiLf andmm^ = 

Since A is full row rank, so are G and H. Since D has positive diagonal entries, 
and HDH^ are symmetric positive definite matrices, and we have our result from 
rem 1. 

^e now use the lower triangular matrices Li and L3, guaranteed by lemma 1, for solving 
vhen A has the partition (1). This is done in the next theorem, which generates a partial 
lesky factor of ADA^. 


orem 2 . Let the mi x mi matrix Li and m2 x m2 matrix L3 be defined as in the 
na 1 . Then, for the m2 x mi matrix L2 and m2 x m2 matrix D with 


D = L3(I - LJ^L2L^L"^)L[, 
LiLj =GDH^, 


Li 0 

L2 I 


■I 01 
0 D| 


( 6 ) 

( 7 ) 


ADA^=LDL^. 


Can be readily verified by a direct multiplication of L, D and . 
s a result of this theorem, the solution of (2) can be expressed in terms of the matrices 
L2, and L3. This is done in the next result, 
ssume that the partitions 


z = 



d = 


di 

d2 


:orm to the partition ( 5 ) of C (= ADA^). Here, di, zi are mi vectors and d2, Z 2 are 
sectors. 


orem 3 . Let the m2 x mi matrix E be defined so that L3E = L2. Equation ( 2 ) can 
olved by the following sequence of steps. 
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Step 1. Find by forward solve 
Liz'i=di. 

Step 2. Define 

Z^2=d2-L2z[. 

Step 3. Find z'^ by forward solve 
L3Z2 = Z2. 

Step 4. Solve 

(I - EE^)z'2" = 4 . (8) 

Step 5. Find Z 2 by backward solve 



Step 6. Find z\ by backward solve 
Lfzi = z\ -L 2 Z 2 . 

Proof. Using the structure of L and D (of theorem 2) the following equations can be readily 
derived 

LiZ| = di, 

Z 2 = d2 - L2Z|, 

DZ2 = Z2, 

L[zi =z[ -L2Z2. 

The theorem now follows from the structure (6) of D. 

The result of theorem 3 requires the Cholesky factors of GDG^ and HDH^, which are 
expected to be less dense than the factor of ADA^. The price for this enhanced sparsity 
is paid at step 4 of the theorem. Here, generally, a dense system of equations has to be 
solved. In the next section, we will present a procedure that solves this system in O (m ^ m 2 ) 
multiplications. If this system were solved by Cholesl;^ factorization, calculation of EE^ 
would require mim^ multiplications, and the calculation of the Cholesky factor another 
1^2 + 5^2 ~ 1^2- In the case m 2 > mi, solving directly is more expensive. 

For a dense matrix C, system (2) can be solved, using Cholesky factors, in ^m^ + 
^m^ — jtn multiplications. Using the technique of this section with a partitioned matnx 
A, it requires + 9m^ 4- 6 m\m 2 — 3m) multiplications, and is not competitive. 

We now establish a basic property of the matrix EE^, encountered in step 4. 

Theorem 4. Let E be defined as in theorem 3, step 4. The spectral radius of EE^ is less 
than 1. 

Proof Since ADA^ is positive definite, so is LDL^ (see theorem 2 for definitions), and 
thus D is positive definite. Using the structure of D, D is positive definite. Thus-, from (6), 
(I — EE^)z > 0 for all z. Thus z^EE^z < z^z for aU z, and we have our result. 



A method for a system with several rank-one updates 


lis section we develop an inductive version of the Sherman-Morrison-Woodbury 
lula for solving the system (8). For each j — 1,..., wzi, let Ey be the yth column of 
natrix E, and 

Emi+l — ’^ 2 - 

), let Bi = I, and for each k = 1,..., m i 
Bjt+i = B/t ~ , 

note that = I — EE^. 

ow, for each ^ = 1,..., mi + 1 and y = 1,..., mi + 1, define 
BfcEf > = Ej. 

s, for each k = I,... ,mi and j — k + I,... ,m\ + I, 


(Bt-EtE[)Ef+‘> = E;, 


(I-Ef>E[)Ej‘+‘* 



^sing the Morrison-Woodbury formula, we can write the solution to (9) as 




(Ey,E^ 


(k)y 


(k) 


:E)^ 




E 


(k) 


^ l-(Ejt,Ei' 




E 


;re (•, •) is the usual inner product of two vectors. 

/e can prove the following result about the above procedure. 


(9) 


sorem 5. The solution t!^ of (8) is 


of This is readily seen since by definition, 


Bmi+l 


P'(wi4-1) 

^wi+l 


- E;7^j+1 


stituting the identities B;„j+i = I - EE^ and E^i+i = z^', we obtain the theorem. 

Tie above inductive procedure (on k) suggests the following algorithm in a vector and 
stributed/parallel environment. 


? i. 1. Communicate Ey to processor y = l,...,mi + l. 
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2. At processor j, setE^^^ = %• 

3. Set k = 1. 


Step 2. From processor k, communicate E^ and (E^,E^'‘'’) to each processor j 

k \ 1ifii \. 

Step 3. At processor j,j = k-\-l,...,m\-\-l compute 




E) 


(fc+1) 


+ 


(E^,E 


(k). 


i-(E^.,En 


T?(k) 
(k)y '^k 


Step 4. Set k = k-\-l.lfk <mi, then go to step 2, otherwise declare e[^^^ as the solution 
4^ of (8). 

A careful count of the number of multiplications needed are summarized in the following 
theorem. 


Theorem 6 . The number of multiplications required by the algorithm to solve ( 8 ) is 
m\m 2 + 2m 1 ^ 2 . In the vector and parallel environment, the number of vector operations 
required is 3m i. 

Proof Can be readily verified by a careful counting. 

There is a considerable advantage in keeping mi < m 2 while solving (8), an m 2 x m 2 
system, by the above strategy. Otherwise, it would require m im^ multiplications to obtain 
EE^, ^(m 2 + 3m2 — 9m2) to obtain the Cholesky factor of I — EE^ and m 2 (m 2 + 1) to 
solve the system using the Cholesky factor. The method of this section is less advantageous 
if m 2 < mi; and to use it effectively mi and m 2 should be re-defined. 

4 . Transportation and assignment problem 

The transportation problem is the following: given m supply depots, with Si (> 0) as 
the units of supply of some good at depot i for each i = 1,..., m; n demand centres 
with dj (> 0) as the units of demand of the good at the centre j for each j = 1, ..,, n; 

Cij (> 0) the cost of shipping one unit of good between depot i and centre j for 
each i = 1,..., m, j = 1,,.., n; find the least cost quantity of good shipped between 
each depot and centre. We assume here that the transportation problem is balanced, or 
Zl/=i = Yl]=i dj\ i.e., there are just enough goods available at the supply depots to 
meet this demand. In this case it is well known that the transportation problem has an 
optimal shipping schedule, and it can be found by solving the following Transportation 
Linear Program, 

m n 
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m 

/=1 

are xij is the number of units of goods shipped from depot i to centre 7 , for each i 

^ j- 

rhe assignment problem is the following: given there are n individuals that can do 
' of the n tasks; and, assigning the individual i to the task 7 costs ^ij(> 0 ) for each 
■■ 1,..., n, j = I,... ,n, find the least cost assignment of individuals to tasks. It is also 
uired that each individual perform exactly one task, and that each task be performed 
exactly one individual. This problem can be cast as a balanced transportation problem, 
h each individual associated with a supply depot with exactly one unit of supply, and 
h task with a demand centre with exactly one unit of demand. The only difference is 
t the variables Xij are required to take on values 0 or 1. As is well known, because of 
total unimodularity property of the constraint matrix, setting Si ~ 1 and dj = 1 for 
•hi = I, ... ,n and j = 1,..., n and solving the transportation linear program suffices 
ind the optimal assignment, 
fhe dual of this linear program is the following: 

m n 

max Y^SiUi + ^ dj vj 
/=1 ;=1 

Uivj h Sij =Cij, forallz = 1 , ...m, j = \,...,n, 

Sij>0, for all / = 1 ,.. .m, j = l,...,n, 

ere m, and vj are the dual variables and Sij are the dual slacks for each i and 7 . 

Fhe constraints of the (primal) linear program fall naturally into two sets, the supply 
istraints and the demand constraints. It is this partition that we will exploit in the 
;orithm. As is well known, the rank of the constraint matrix is m + n — 1, which is one 
s than the number of constraints. Thus one constraint must be discarded to ensure that 
! constraint matrix has full row rank. We will discard the supply constraint associated 
th the depot m, and use the following constraint matrix: 

"e^ 0 0 0 

0 • • 0 0 

A = 

0 0 • • 0 
I III 

lere = (1,1,..., 1), an n vector, and I is the n x /i identity matrix. 

Thus we define. 



r T 

0 

0 

0 " 


0 

T 

0 

0 

G = 




H = (I,I,. .,1), 


0 

0 

T 

• • e'' 

0 



where G is an (m — 1) x mn and H is an « x matrix. The diagonal matrix D has mn 
entries along the diagonal, and has the partition 


0 0 
0 0 


0 0 


where is on nxn diagonal matrix, and its jth diagonal entry is the (/th diagonal 
entry of D, which corresponds to the entry of D associated with the variable Xij. The exact 
form of this entry depends on the particular interior point algorithm implemented. 

It can now be readily confirmed that 


GDG^ 


-eT’Ddle 

e^D(2)e 




an(m — l)x(m — 1 ) diagonal matrix; and 


HDH^ = 

j=i 

mn X n diagonal matrix. Thus Li andL 3 are diagonal matrices, with the jth diagonal 
entry of Li andLs being and (YT-i »respectively. Also 


GDH^ 


■ ■ 

e^D("‘-i) 


and 


L 2 = (D<^^e, ..., D('"~^)e)L7^ 

Thus 

E = L^^L2 

which is an n X (m — 1 ) positive matrix. 


Theorem 7 . Each iteration of the interior point method requires nm? +6nm4-2(m—« + 1 ) 
multiplications for the partitioned assignment problem. 


Proof Using the stmcture of Li, L2 and L3; and solving ( 8 ) by the method of § 3, and 
carefully counting the multiplications at each step, we get the theorem. 
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Generalized upper bounding problem 


generalized upper bounding 

linear program is 

the following: 



minimize cqxo 

+ 

T 

c{ xi 

+ 

T 

C^X2 

+ ••• 

+ CpXp 


AqXo 

+ 

Aixi 

+ 

A 2 X 2 

+ ••• 

kpXp 

= b 



T 

e;x2 





= 1 





ejx2 



= 1 








= 1 

xo > 0 


xi > 0 


X2 > 0 


Xp >0 



;re, for each j = 0 ,..., kj is an m x nj matrix, Cj and xy are nj vectors; and for 
: 1,ey a is an nj vector of all ones. Transportation and assignment problems 
e this structure as well, but in important applications, m « p (i.e., m is much less 
1 p). We make the usual assumption that the constraint matrix has the full row rank 
^ P- 

is evident, the constraints of this problem have the natural partition (1) with 


G — (Aq, Ai,..., kp) 


"0 ef 0 ••• 0 “ 

00 el •• 0 

H = 

0 0 0 • • ej 

nd the diagonal matrix D has the partition (corresponding to the partiton of A) 
■d( 0 ) 

D(i) 


d(p) 


( 10 ) 


vhere is an nj x nj diagonal matrix corresponding to colunms of Xy in A, for each 

= I,.. p. 

lere 


GDG^ = ^AyD(-^'^Aj, 


y=o 

e[D(i)ei 


HDH^ 


e?’D(2)e2 


ejD(P)ep_ 
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with HDH^ a diagonal matrix. Thus L 3 is a diagonal matrix, and Li, the Cholesky factor 
of the m X m matrix 

f:AjDWAj =LiL[. 

j=0 

In applications, even when Aj are sparse, we would expect their sum to be considerably 
more dense, and thus we expect Li to be relatively dense. Also 

GDH^ = (AiD^i^ei, ..., A^D^^^ep). 

ThusifL2 = (li,l2,...,lp), 

and we can expect the p x m matrix L 2 to be dense. In addition, E = LJ^L 2 - 

Theorem 8 . Each step of an interior point method requires ^pm^ + 4- ^pm + 

+ 2p when applied to the partitioned GUB problem. 

Proof Can be obtained by a careful calculation of the work at each step of the algorithm. 
The multiplications required to obtain the Cholesky factor Li are included in the above 
formula. 


6. Block diagonal linear program 


The block diagonal linear program is the following; 

minimize CqXq 
AqXq 


c[xi + 

cjxz 

+ 

•• + 



Aixi + 

A2X2 


•• + 

ApXp 

= bo 

A1X2 





= bi 


A2X2 




= b2 





ApXp 


xi > 0 

X2 > 0 



Xp > 0 



Xo > 0 

where, for each j = 0,..., p, Aj is an mo x nj matrix, cj and xj are nj vectors; and, for 

each y = 1,..., p. A; is an mj x nj matrix. Let m = m^ and M = mi H-h m^. The 

constraints of this problem have the natural partition ( 1 ) with 


G = (Ao, Ai,..., Ap) 

’0 Ai 0 ...o' 

00 A 2 • 0 

H = 

0 0 0 - Ap 

and the diagonal matrix D has the partition (10). It can be verified that 
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p 


GDG?' = X:AP<-^)aJ; 

J=0 


[AiD^DAf 


HDH^ = 


A2D(2)A[ 


L A;,d(/^)a;j 

(lock diagonal matrix. As in the GUB case, the factor Li of GDG^ will be relatively 
e. L 3 , the Cholesky factor of HDH^ is block diagonal, and each diagonal factor can 
)mputed independently. Thus, we can assume that 


rui 



L L 3 pJ 

hen Aj is sparse, we can preserve the sparsity of Lsy (the Cholesky factor of AjD^^'^Aj ) 
le techniques of sparse Cholesky factorization, George & Liu (1981). Also 

GDH^ = (AiD<‘>A[, ..., ApO'^-’A^) 

m x M matrix. If Lj = (L 21 ,.. -, 

L,Lf^=AyDO)Aj. 

jxpect L 2 ; to be, generally, dense. 

;t E = (E^^\ ..., which conforms to the partition of L3, Then, for each 

1,2, ...,p 

L3jE«=L22- 

we can readily establish the following theorem: 


orem 9. For a dense problem, it takes g + 3m? — 9mj) + ^Mm(m + 

- y Ylj=i^jOnj + 1) multiplications to generate Li, L 2 , L 3 and E; and, takes 
^ + 4Mm + 4 - m + 2Zf=i w? + M multiplications to solve the systems of the- 

t (4). If mj = m for all j = 1, , p, the above forms reduce to ^(m^ 4- 3m^ — 
f pm^ -{-3pm?' — 9pm) 4 - \pmm{m 4 - 1) 4- ^pmm{m 4- 1) and pm^m -^4pmm A 
f- m 4- pm? -f pm respectively. Thus, in this case, the computations grow linearly 


yf. This can be readily proved by a careful counting of the required multiplications. 


arc incidence matrix of the directed network through which these commodities flow. 
Thus, 

GDG^ = 

;=o 

is diagonal, and the sparsity pattern of , for each i = 1 ,is the same. 

Also, 

Thus 

L 3 iE«>=RD<') 

Lsi is the Cholesky factor of , and, for each z, has the same sparsity pattern as that 

of the factor of RR^. It is readily confirmed that, in the notation of George & Liu (1981), 
the graph associated with RR^ is the unordered network on which the commodities flow. 
This facilitates considerably the use of their techniques for generating sparse Cholesky 
factors Lsi . Since each column of R has only two nonzero entries, will be sparse; and 
will have the same sparsity pattern for each i. 

For a network with m nodes, n arcs and p commodities, Li is an zr x n diagonal matrix, 
for each i — I,..., p, Ls,- is an m x m matrix, £ 2 / is an m x n and E is pm x n. Also, 
assume that it takes 8 multiplications to get a sparse Cholesky factor of RR^; and note 
that 8 < + 3m^ — 9m) \ and that the number of non-zero elements in this factor are 

T). Then, Ae following can be shown. 

Theorem 10. Given Li, L 2 , L 3 and E, it takes pmn^ + 4pn 2n prj multiplications 
to solve all the systems of theorem 3. 

7. Computational experience 

In this section, we present some computational experience of solving assignment problems 
by the procedure suggested here and by the state-of -the-art code LOQO (Vanderbei 1992). 

LOQO is a state-of-the-art interior point code based on the primal-dual homotopy 
method, and implements a predictor-corrector strategy for tracing the path of centres . 
The per iteration times of this code are compared with the per iteration times of a special¬ 
ized transportation code, implementing the dual affine scaling strategy. The LOQO per 
iteration times may be a little larger because of the step size selection in the predictor- 
corrector strategy. The per-iteration times and their ratio are given in table 1. We point out 




-D^-) 
=1 


( 11 ) 



lem Size 


Assignment 


LOQO 

Ratio 

Loqo/Asgn 

Iter 

Time* 

Iter time 

Iter 

Time 

Iter time 

200 x 200 

14 

20.78 

1.48 

15 

75.72 

5.048 

3.41 

200x200 

17 

25.09 

1.476 

17 

85.30 

4.911 

3.33 

200 x 200 

21 

31.02 

1.477 

20 

96.61 

4.830 

3.27 

300x300 

14 

65.72 

4.69 

18 

301.07 

16.72 

3.565 

300x300 

19 

88.80 

4.67 

19 

314.16 

16.53 

3.540 

300x300 

21 

98.06 

4.67 

21 

418.5 

16.86 

3.61 


is time is the total time for all the iterations, without the input/output time. All times are in seconds. 


for these special problems, the specialized version was about 3 to 4 times faster than 
general purpose code, LOQO. 

he three problems of size 200 x 200; and, of size 300 x 300 are generated randomly, 
present increasing difficulty to interior point methods. On the first problem, these 
hods converge to a solution in the interior of a face, with very few variables at value 1. 
the second problem, these methods converge to a solution with more than 75% of the 
ables at value 1, while on the third problem they converge to a vertex, with all variables 
due 1. For the first problem, LOQO found a solution to the accuracy of eight significant 
res, but for the second and third, it was unable to find this accurate a solution. For these 
Diems it found a solution to 7 digits of accuracy. 
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Abstract. The problem of obtaining long sequences with finite alphabet and 
peaky aperiodic auto-correlation is important in the context of radar, sonar and 
system identification and is called the coded waveform design problem, or sim¬ 
ply the signal design problem in this limited context. It is good to remember 
that there are other signal design problems in coding theory and digital com¬ 
munication. It is viewed as a problem of optimization. An algorithm based on 
two operational ideas is developed. From the earlier experience of using the 
eugenic algorithm for the problem of waveform design, it was realised that 
rather than random but multiple mutations, all the first-order mutations should 
be examined to pick up the best one. This is called Hamming scan, which has 
the advantage of being locally complete, rather than random. The conventional 
genetic algorithm for non-local optimization leaves out the anabolic role of 
chemistry of allowing quick growth of complexity. Here, the Hamming scan is 
made to operate on the Kronecker or Chinese product of two sequences with 
best-known discrimination values, so that one can go to large lengths and yet 
get good results in affordable time. The details of the ternary pulse compres¬ 
sion sequences obtained are given. They suggest the superiority of the ternary 
sequences. 

Keywords. Coded wavefoim design; global optimization; bi-parental prod¬ 
ucts; Hamming scan. 


Introduction 

term signal design has different meanings in coding theory, spread-spectrum com- 
lication, and radar. Here it is short for a coded waveform design problem in radar. 
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This involves obtaining sequences of finite lengths with small discrete alphabet, which are 
good approximations to white noise (Golay 1977). In particular, the alphabet (—1,0, 1) is 
considered. The sequences using them are called ternary sequences. Much of the earlier 
work deals with binary sequences with (—1,1) as the alphabet. The goodness of the degree 
of approximation is measured here in terms of the discrimination. An alternative measure 
is the merit factor (Golay 1977). 

To set the notation and to concretize the ideas, let 


S — [‘^0» 15 • • • 1 —2 5 — 1 ] 


( 1 ) 


be the sequence of length n, where the elements Si are from any of the two alphabets 
referred to above. Then 

n—\—k 

p(^)= X] /: = 0,1,...,«-1, _ (2) 

1=0 

is called the aperiodic auto-correlation of the sequence s. The quantity. 


D = p(0) 


_ K^O 


(3) 


is called the discrimination. 

The problem of obtaining sequences with high discrimination is a very difficult one. 

It is so because whereas for the binary sequences with good periodic auto-correlation, 
analytical design procedures or constructions (Baumert 1971; Golay 1983; Hoholdt et al 
1985; Reddy & Rao 1986; Jensen et al 1991) based on powerful number-theoretical results 
are available, for sequences with good aperiodic autocorrelations it has still to be basically 
a search. Further, the search cannot be exhaustive, because the number of sequences to 
be searched for grows combinatorially, as the length increases. Here, the emphasis is 
on obtaining as good sequences as possible with certain reasonably efficient procedures 
without claiming that even higher discrimination values are not possible. Golay’s notion 
of sieves (Golay 1977) which restricts the search to a subset of sequences which can either 
be designed or searched efficiently and has properties which are desirable for enhancing 
the peakiness is a similar one. The sieves promise some good sequences without claiming 
that all good or even the best sequences are necessarily retained. 

There is extensive work on binary sequences (Barker 1953; Turyn 1963,1968; Boehmer 
1967; Baumert 1971; Golay 1972, 1977, 1982, 1983; Moharir 1975; Beenker et al 1985; 
Hoholdt et al 1985; Kerdock et al 1986; Reddy & Rao 1986; Bemasconi 1987, 1988; 
Hoholdt & Justesen 1988; Golay & Harris 1990; Newmann & Byrnes 1990; Jensen et cil 
1991; De Groot et al 1992), which however we do not attempt to review here. Beyond 
the Barker sequences (Barker 1953) of length 11 and 13, having the discrimination values 
of 11 and 13, the four best binary sequences having discrimination values of 14, 15, 16 
and 17 (Kerdock et al 1986) have been obtained at the smallest lengths of 28, 51, 69 and 
88 . Complete enumeration is possible only up to small lengths (Kerdock et al 1986) and 
would not be regarded as too satisfactory, the ultimate aim being construction rather than 
search. Partial searches using the notion of sieves (Golay 1977) were extended to higher 
lengths. Starting with binarv senuences with ideal neriodie antoeorrelatinn and RP.arr.H-it-t o 
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ve been obtained. To cite some examples (Reddy & Rao 1986), discriminations of 22.82, 
.85, 36.86, 41.45, 41.77 and 42.46 have been obtained at the lengths of 251, 349, 811, 
9, 919 and 1019, respectively. 

Optimization techniques have been used for the signal design problem (Bemasconi 
87, 1988; De Groot et al 1992), but the design criterion has been different, viz., merit 
:tor. 

Temai 7 aperiodic sequences have been proposed and listed earlier with merit factor as 
5 criterion of goodness (Moharir 1974, 1976; Moharir et al 1985). They were obtained 
mg sieves such as terminal admissibility (Moharir 1976), skew-symmetry (Golay 1977), 
minal admissibility for skew-symmetric sequences (Moharir etal 1985) etc. This earlier 
)rk establishing the superiority of ternary pulse compression sequences over binary 
ijuences, if merit factor is accepted as a valid desideratum, is extended further in this 
per with discrimination as a chosen criterion. 

The algorithm and associated concepts 

I Hamming scan 

1 improved version of the genetic algorithm (Holland 1992; Michalewicz 1992) called 
genic algorithm was used (Singh et al 1996) to see whether better and longer temar}' 
quences could be obtained. Success was met with along both these directions. Yet it was 
•t possible to go to very large lengths, as the search time requirement increased very fast, 
was regarded that one should devise an optimization algorithm which is more efficient, 
en though possibly more suboptimal. The Hamming scan is one such algorithm. Genetic 
^orithms use random but possibly multiple mutations. Mutation is a term metaphorically 
ed for a change in an element in the sequence. Thus, a single mutation is at a Hamming 
stance of one from the original sequence. The Hamming scan looks at all the Hamming 
1 neighbours and picks up the one with the largest discrimination. If it is better than 
e original sequence with the chosen definition of goodness, the algorithm is recursively 
ntinued therefrom, as long as improvement is possible. Thus, an entirely probabilistic 
echanism of mutation is replaced by a locally complete search. The Hamming scan is 
pedited, and hence, made applicable at large lengths, by not calculating the aperiodic 
focorrelation of a Hamming neighbour ab initio, recognizing the fact that as only one 
3 ment is different, only its different contributions need to be taken into account. Let the 
sment sj be changed to Cj. As a result, let p (k) change to p'{k). Then it can be shown that 

/(0) = p(0) + (c?-^J), (4) 

id 


p'(k)=p(k) + (cy - Sj)Sj+k + - sy), k = 1,2,.., ,n - V, 

y = 0,l,.(5) 

In (5), there are two correction terms. They have to be implemented with care. One 
ay is to assume that Sp is equal to zero if p is outside (0,1,..., n — 1). Alternatively, 
e correction term sj-idcj — sj) is included only for k = 1,2,... ,n — j — I and the 
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correction term sj-kicj — sj) is included only fork = j + 1, j + 2,.;., n — 1. This idea ' 
is certainly trivial, but it has led to significantly increased efficiency, and hence, to search 
at longer lengths than would otherwise have been possible. 

For the ternary sequences the actual implementation can be somewhat different The ^ 
elements in the sequence are —1, 0 and 1. Each one of them can mutate in two possible 
ways. The mutations —1 —^ 0, 0 -> —1 and 1 —^ —1 are considered to give the lower 
strand of Hamming neighbours and the other mutations — 1 -> 1 , 0 - 4 - 1 and 1 -4- 0 are 
regarded as giving the upper strand of Hamming neighbours. The best neighbours along 
these two strands were found separately. The idea is that if one strand gives improvement, j 
the other strand may not even be considered in order to save time. As the results are certainly j 

path-dependent and optimality is not guaranteed, efficiency is a valid determinant. ! 

The Hamming scan yielded some better ternary sequences in reasonable time than 
were obtainable with the eugenic algorithm. However, the Hamming scan also became 
unaffordable at larger lengths. 

2.2 Simon’s principle through Kronecker and Chinese products 

Thatis when Simon’s principle (Koestler 1969; Simon 1981) suggested newer possibilities. 

It states that bigger systems evolve faster, when developed through the metastable inter¬ 
mediate subsystems, than if they are constructed ab initio from the smallest components. 

In the present context, it took the form of using two sequences of the best discrimination 
values available and obtaining a sequence of much larger length from them, such that it 
already had better discrimination than would result from random choice. The actual mech¬ 
anism is provided by bi-parental products (Moharir 1992) of two or more sequences. In 
particular, two products, viz., Kronecker product (Brewere 1978; Moharir 1992) and the 
Chinese product (Moharir 1977,1992) are chosen. These products are said to be bi-parental 
because each element in the product depends exactly on one element each from the two 
component sequences. 

The Kronecker product of two sequences 

Si = [501,511, ...,5(p_l)l], S2 = [502, 512, . . . , 5(^-1)2], (6) 

of lengths p and q respectively, is a sequence s of length pq, defined as 

Sk = (501S2, 511S2 , . . . , 5(p_i)iS2). (7) 

The Kronecker product is not commutative. 

The Chinese product of the two sequences of ( 6 ) is defined only when p and q are 
relatively prime, that is, when they do not have any common prime factor. The Chinese 
product is defined as 

‘5‘Ck = 5if52;, (8) 

where 

, f i mod p 

~ [ymod^ 

and has the length pq. It is called the Chinese product because the solution of the con- 
gmence relation (9) is what the Chinese remainder theorem deals with. It is commutative. 
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nputationally, the Chinese product of the two sequences can be obtained easily by re- 
tmg the sequences Si and Sj of relatively prime lengths p and q respectively, q and p 
es and then taking an element-wise (Schur) product (Moharir 1992). 

:'he autocorrelation of sr can be expressed in terms of the autocorrelations of Si and s, 
bllows (Turyn 1968). 

Borem 1. 

Pski<jkl -f k2) = Psx(k\)Ps2(k2) + Psi(k\ -b \)Ps^{q - k2), 

/:i =0, 1, ...,p - 1; A:2 = 0, 1, - 1. (iQ) 

^he theorem shows that if the individual autocorrelations are good, so is the resultant 
ocorrelation, except at some lag values. It can further be shown that the discrimination 
'he Kronecker product of two sequences depends only on the discriminations of the 
^iponent sequences. The attenuated minimum guarantee theorem below is important. 

eorem 2. If the discriminations of the two sequences Si and S 2 are D\, and D 2 respec- 
dy, with min(Di, D 2 ) = f^min max(Di, D 2 ) = i9max. ^hen the discrimination Dk 
7oth their Kronecker products Si x §2 and Si x Si is bounded as 

^min ^ ^ — (-Dmax/(1 4- f^max))) (11) 

ere a may be called the attenuator. 

fhe following has been shown by Moharir (1977). 

eorem 3. The periodic autocorrelation of the Chinese product of two sequences is a 
inese product of their periodic autocorrelations. 

\ sequence obtained by a bi-parental product (Moharir 1992) of two or more sequences 
i later be further improved by a Hamming scan. The time requirement comes down 
isiderably because the algorithm begins with a good starting point. The importance of 
ilocal optimizers such as the genetic algorithm is that the end result depends on the 
rting point only weakly, and ideally, not at all. But this statement is concerned with 
y the ultimate reachability of the algorithm, and the utility of a good starting point in 
ermining the time requirements should not be underestimated. This is particularly so 
en good starting points can be designed or devised. Second, global optimization also is a 
il and not a reality. In any case, the Hamming scan is not a global optimization algorithm, 
erefore, if a large starting discrimination can be obtained by a simple procedure, in 
iition to the time saved, many lower local optima of which the Hamming scan possibly 
lid not come out would also be avoided. 

Fhe anabolic bi-parental product scheme really works well as can be seen from table 1. 
inalyses the role of the Kronecker and the Chinese products in the proposed algorithm, 
me ternary sequences with very good discrimination were chosen. They were obtained 
m the ternary sequences with the best merit factors listed by Golay (1977) by recursive 

rv>- 1 5 vvn 1 0 + 1 O 1'Vlf^lT’ TCt'OTlP.P.lCftT*/5lTCl C^hlTlGS0 OrOdliCtS 
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Table 1. Analysis of the efficacy of the bi-parental products, viz. Kronecker and Chinese, in designing 
ternary sequences with large discrimination. The lengths of the component sequences are n \ and n 2 . Their 
discriminations are Di and Do - The discrimination of the product sequence is D. The bi-parental product 
efficiency is ?? and the exponent is y. Whether the bi-parental product P is Kronecker or Chinese is 
indicated by K and C respectively. 


n\ 

in 

n 

Dx 

Di 

P 

D 

n 

Y 

31 

31 

961 

13.0000 

13.0000 

K 

12.7547 

0.9811 

0.4963 

9 

33 

297 

7.0000 

12.5000 

K 

7.0000 

0.7483 

0.4352 

33 

33 

1089 

12.5000 

12.5000 

K 

12.5000 

1.0000 

0.5000 

24 

111 

2664 

20.0000 

20.0000 

K 

19.7531 

0.9877 

0.4979 

24 

147 

3528 

20.0000 

20.0000 

K 

19.6721 

0.9836 

0.4972 

24 

159 

3816 

20.0000 

21.8333 

K 

20.0000 

0.9571 

0.4928 

11 

13 

143 

11.0000 

13.0000 

K 

11.0000 

0.9199 

0.4832 






C 

8.9375 

0.7474 

0.4413 

13 

15 

195 

13.0000 

13.0000 

K 

13.0000 

1.0000 

0.5000 






C 

12.0714 

0.9286 

0.4856 

11 

17 

187 

11.0000 

11.0000 

K 

11.0000 

1.0000 

0.5000 






C 

6.7222 

0.6111 

0.3973 

11 

23 

253 

11.0000 

19.0000 

K 

11.0000 

0.7609 

0.4488 






C 

7.7407 

0.5354 

0.3831 

19 

31 

589 

13.0000 

13.0000 

K 

13.0000 

1.0000 

0.5000 






C 

12.0714 

0.9286 

0.4856 

13 

33 

429 

13.0000 

12.5000 

K 

12.5000 

0.9806 

0.4961 






C 

10.1563 

0.7967 

0.4554 

29 

33 

957 

8.6667 

12.5000 

K 

8.6667 

0.8327 

0.4609 






C 

8.5526 

1.2141 

0.5381 

24 

145 

3480 

20.0000 

23.2000 

K 

20.0000 

0.9285 

0.4879 






C 

11.2621 

0.5228 

0.3944 

24 

155 

3720 

20.0000 

22.1667 

K 

20.0000 

0.9499 

0.4916 






C 

19.4161 

0.9221 

0.4867 

15 

17 

255 

13.0000 

11.0000 

K 

11.0000 

0.9199 

0.4832 






C 

11.0000 

0.9199 

0.4832 

11 

25 

275 

11.0000 

10.5000 

K 

10.5000 

0.9770 

0.4951 






C 

10.5000 

0.9770 

0.4951 

13 

31 

403 

13.0000 

13.0000 

K 

13.0000 

1.0000 

0.5000 






C 

13.0000 

1.0000 

0.5000 

9 

11 

99 

7.0000 

11.0000 

K 

7.0000 

0.7977 

0.4480 






C 

8.5556 

0.9750 

0.4942 

9 

23 

207 

7.0000 

19.0000 

K 

7.0000 

0.6070 

0.3979 






C 

8.3125 

0.7208 

0.4330 


{Continued onfacing page') 


of these two products are listed. It can be seen that impressive starting points for the 
Hamming scan are available. The purpose of table 1 is, however, different. It compares the 
two products as the bases of the bi-parental product signal design. In table 1, six situations 
can be identified. In view of theorem 2, there is not much uncertainty about what the 
Kronecker product can achieve. For some entries in table 1, the component lengths are 
such that only the Kronecker product is defined and the Chinese product is not defined 
as the two lengths have a common factor. For these sets of component lengths, only the 
Kronecker product can be used. For the next set of entries, Dq is less than Dk- For the next 
four sets of entries, Dq equals Dk, lies between Dk and Dmax, equals Dmax? and exceeds 
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le 1. (Continued) 


ni 

n 


Di 

P 

D 

V 

Y 

29 

261 

7.0000 

8.6667 

K 

7.0000 

0.8987 - 

0.4740 





C 

8.2727 

1.0621 

0.5147 

29 

377 

13.0000 

8.6667 

K 

8.6667 

0.8165 

0.4571 





C 

11.2667 

1.0614 

0.5126 

29 

435 

13.0000 

8.6667 

K 

8.6667 

0.8165 

0.4571 





C 

11.6552 

1.0980 

0.5198 

29 

725 

10.5000 

8.6667 

K 

8.6667 

0.9085 

0.4787 





C 

9.1000 

0.9539 

0.4895 

25 

425 

11.0000 

10.5000 

K 

10.5000 

0.9770 

0.4951 





C 

11.0000 

1.0235 

0.5049 

25 

525 

8.5000 

10.5000 

K 

8.5000 

0.8997 

0.4765 





C 

10.5000 

1.1114 

0.5235 

31 

651 

8.5000 

13.0000 

K 

8.5000 

0.8086 

0.4548 





C 

13.0000 

1.2367 

0.5452 

21 

357 

11.0000 

8.5000 

K 

8.5000 

0.8790 

0.4716 





C 

13.3571 

1.3814 

0.5712 

21 

399 

13.0000 

8.5000 

K 

8.5000 

0.8086 

0.4548 





C 

13.8125 

1.3140 

0.5580 

27 

459 

11.0000 

8.3333 

K 

8.3333 

0.8704 

0.4693 





C 

11.4583 

1.1968 

0.5398 

29 

493 

11.0000 

8.6667 

K 

8.6667 

0.8876 

0.4738 





C 

13.0000 

1.3314 

0.5628 

29 

551 

13.0000 

8.6667 

K 

8.6667 

0.8165 

0.4571 





C 

14.0833 

1.3268 

0.5599 

29 

609 

8.5000 

8.6667 

K 

8.5000 

0.9903 

0.4977 





C 

9.2083 

1.0729 

0.5164 

29 

783 

8.3333 

8.6667 

K 

8.3333 

0.9806 

0.4954 





C 

9.4203 

1.1085 

0.5241 

31 

465 

13.0000 

13.0000 

K 

13.0000 

1.0000 

0.5000 





C 

13.5200 

1.0400 

0.5076 

33 

825 

10.5000 

12.5000 

K 

10.5000 

0.9165 

0.4821 





C 

13.1250 

1.1456 

0.5279 

33 

1023 

12.5000 

12.5000 

K 

12.5000 

0.9806 

0.4961 





C 

15.4762 

1.2141 

0.5381 


lax- Thus, the Chinese product has the possibilities of providing better starting points 
n those provided by the Kronecker product, for the bi-parental product algorithm for 
nal design. 

f is useful to define some quantitative measures for the bi-parental products in the 
itext of the objective function chosen. The Kronecker efficiency t^k (for discrimination) 
lefined as 

nK = DK/(DiD2)''^. ( 12 ) 

s Kronecker exponent is defined by 

Dk = (DiD 2 V’^ or XK = logDK/log(i9iZ)2). (13) 

fable 1 shows values of t^k and xk • It can be seen that they lie in narrower ranges than is 
jgested by theorem 2. A starting sequence of a particular large length can be obtained by 
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factoring the length differently. In such cases two thumb rules could be used for guidance, 
the Chinese efficiency 77 c and the Chinese exponent yc can be defined by replacing Dk by 
Dc in (10) and (11), where Dq is the discrimination obtained by using a Chinese product. 
Table 1 also shows ?7 k and yk- The ranges of these are larger than those of tjk. and yk- The 
extension of the ranges on the higher sides must be viewed as advantageous. 


2.3 Hamming scan and Simon’s principle 

Our interest was in obtaining ternary sequences with good discrimination values. The 
Kronecker product is not commutative. But the discriminations of Si x S 2 and S 2 x Si 
are the same. The Chinese product may have a superior, equal or inferior discrimination. 
Subsequently, however, they may evolve differently under a Hamming scan, so that the 
initial advantage of a superior discrimination may not last. When starting sequences of a 
particular length can be obtained by bi-parental products using different factors, further 
progress under a Hamming scan is also different. These are indicators that a Hamming scan 
does not give the global optimum. Let the ratio of the discrimination eventually obtained by 
recursively using a Hamming scan to the discrimination of the starting sequence obtained by 
a bi-parental product be called the Hamming gain for that product. It is not as constant as the 
bi-parental product efficiency or the exponent. Let both the components in the Kronecker 
product be Barker sequence of length 11. Then the discrimination of the resultant sequence 
of length 121 cannot be improved by the Hamming scan, meaning that the Hanrming gain 
is just unity. But such situations are rather rare. Variability of the Hamming gain makes it 
difficult to use a good starting point as a very dependable criterion and offers no good thumb 
mles. One may make a choice on the basis of the average size of the improvement in the 
first few Hamming scans. But that is more a way of trying to minimize the computational 
effort than trying to estimate beforehand what discriminations may be achievable, as the 
improvement is not uniform over successive Hamming scans and the number of successful 
scans is also not known beforehand. It has been observed that the Hamming gain for 
the Chinese product is frequently less than that for the Kronecker product. It may also 
be noted that a sequence obtained by Hamming scan operating on a bi-parental product 
of two sequences, can later be used as a component in the same or different bi-parental 
product. All these possibilities have not been exhausted. 

However, one more idea has been found to be very useful. After taking a bi-pai'ental 
product of two good sequences, if the merit factor is improved by the recursive Hamming 
scan for some time and then the discrimination is taken to be the objective function to be 


Table 2. The number of lengths Nt at which ternary sequences exceeding the discrimination threshold 
of Dt have been obtained. 


Df 

Nf 

Dt 

Nj 

Dt 

Nt 

48 

2 

44 

7 

40 

27 

36 

47 

34 

75 

32 

119 

30 

170 

28 

242 

26 

304 
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'le 3. A list of ternary sequences, with discrimination values greater than 36, in the descending order 
le discrimination. 


gth 

Discrimi¬ 

nation 

Length 

Discrimi¬ 

nation 

Lengtli 

Discrimi¬ 

nation 

Length 

Discrimi¬ 

nation 

4 

48.9048 

924 

42.4667 

1584 

40.2414 

1001 

37.6087 

1 

48.2222 

649 

42.4615 


40.0000 

592 

37.5455 

3 

47.3125 

975 

41.7059 


40.0000 

555 

37.4000 

1 

44.6667 


41.3889 

584 

39.8000 

596 

37.2727 

9 

44.4444 

819 

41.1538 

mmm 

39.7667 

593 

36.9091 

2 

44.2400 

1188 

40.9474 


39.6000 

851 

36.8125 

1 

44.0588 

1305 

40.8889 


38.5714 

1121 

36.5789 

0 

43.5385 

726 

40.8571 

925 

38.5625 

1608 

36.5556 

0 

43.3214 

623 

40.7000 

625 

38.1818 

1617 

36.1389 

7 

43.1579 

858 

40.6471 

1071 

37.9524 

1053 

36.1200 

8 

43.0000 

1750 

40.6333 

1173 

37.7500 

767 

36.0000 

9 

42.5294 

792 

40.3846 

1464 

37.6429 




reased by the same procedure, superior discrimination values frequently result, than if 
discrimination was increased from the start. 


Results 

ly the sequence having the largest discrimination obtained at any length has been re- 
led. The numbers of lengths at which various discrimination thresholds have been 
'eeded are shown in table 2. Thus the discrimination thresholds of 32, 26 and 18 have 
;n exceeded at more than 100, 300 and 400 lengths respectively. The details about the 
[uences having the discrimination values of greater than 36 are shown in table 3. 

[t is seen that as the length increases it is easier to reach higher discrimination values, 
is is a simple consequence of theorem 2. To see this point, assume that the work of 
;aining ternary sequences with good discrimination values is conducted up to the length 
160. Then, the best sequences are of lengths 24, 111, 145, 147,155 and 159, and have 
criminations of 20, 20, 23, 20, 22.1667 and 21.8333 respectively. That is, they all have 
criminations exceeding 20. Then, theorem 2 implies that a Kronecker product of any two 
them must have a discrimination exceeding 19.05. In general, the actual discrimination 
lieved is frequently much better than aDminy as can be seen from table 1. Hamming 
in can then raise the discrimination even further. It is veiy rare that the Hamming gain 
ust 1. Thus, the highest discrimination obtained up to any length can almost always be 
needed at some higher lengths. 


Conclusion 

e bi-parental product algorithm has given ternary sequences with very good discrimi- 
ions. The success of the algorithm indicates that the locally complete search may be 
tferable to the Monte Carlo search which depends rather excessively on the efficacy of 
ince in the matter of optimization. Whereas chance is a useful ally while dealing with 
mbinatorially complex optimization problems, where feasible, it should be helped by 
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design procedures (Singh et al 1996) which can minimise the search effort. The anabolic | 
role of bi-parental products which permitted going to large lengths quickly is particularly s 
noteworthy in this regard. The algorithm has taken tlie problem of aperiodic signal design 
one step closer to that of the periodic signal design. The latter has two features. One of 
them is the availability of the regular construction procedures. That goal is still far away 
for the former problem. The second feature is that the discovery of a new good sequence 
means automatic construction of good sequences at many larger lengths. The bi-parental 
product procedure in the algorithm gives that feature to the aperiodic signal design problem 
also, though in a weaker sense, as the Hamming scan, which includes choice, is still to be 
performed. 

To the best of our knowledge, the ternary sequences obtained here are the best. Yet, 
the bi-parental product algorithm is not a global optimisation algorithm. Therefore, there 
could be procedures which can improve upon the sequences obtained here. 


The authors are grateful to Dr H K Gupta for encouragement. We are also thankful to 

Sri K Subba Rao and Dr K Sain for expert help in electronic processing of the manuscript. 
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Abstract. A medium resolution beamline, viz. photophysics beamline is be¬ 
ing built at INDUS-I, a 450 Mev synchrotron radiation source (SRS) at the 
Centre for Advanced Technology (CAT), Indore. To house the pre- and post- 
focusing toroidal mirrors and permit precision movements for steering the SRS 
beam up to the sample, two ultra-high vacuum (UHV) compatible min'or boxes 
and mirrors have been designed, fabricated and tested. Details of the setup, 
including UHV testing (P < 10“^ mbar) and performance evaluation of the 
mirror mounts under UHV conditions, are discussed in this paper. 

Keywords. Mirror box; mirror mount; photophysics beamline; residual gas 
spectrum; INDUS-1. 

Introduction 

US-I is a 450 Mev synchrotron radiation source (SRS) being constructed at the Centre 
Advanced Technology, Indore (Ramamurthi 1992). The radiation emitted from the 
ige ring is useful for performing experiments in the soft X-ray to infrared region. 
eamUne, viz. photophysics beamline dedicated to experiments on photophysics at 
US-I, is under fabrication. This beamline makes use of a toroidal mirror for focusing 
ight [40 mrad(//) x 6 mradfV)] from the tangent point of INDUS-I onto the entrance 
)f a one-metre Seya-Namioka type of monochromator and a second toroidal mirror for 
sing the monochromatic light emerging from the exit slit of the monochromator onto 
sample (1 mm x 1 mm spot size) positioned at a distance of one metre from the centre 
le focusing mirror. Detailed optical and mechanical layouts of the beamline formed 
)asis of the mirror box and mirror mount designs (Das & Rajasekhar 1992; Meenakshi 
i Rao et al 1992). A side view of the photophysics beamline is shown in figure 1. 
mate pressure in the beamline has to be maintained at <10“^ mbar*. In addition to 


5ar= 10^ Pa 
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SEYA-NAMIOKA 
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Figure 1. Mechanical layout of photophysics beamline. 
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5 the prime requirement for connecting the beamline to the storage ring which operates 
ressure of <10“^ mbar such ultra-high vacuum (UHV) environment also protects the 
ces of the optical components. Installation of the beamline to get maximum flux at the 
lie position requires precise and reproducible rotational and translational movements 
e mirrors about the X, 7 and Z axes under ultra-high vacuum conditions so as to 
the synchrotron radiation source (SRS) beam up to the sample. The fine movements 
red are achieved with the help of UHV compatible mirror mounts. Details of design, 
:ation and testing of the mirror mounts and miiTor chambers are described below. 


description 

Design considerations 

Mirror mounts: In case of mirror mounts for holding both pre- and post-mirrors, 
idjusting their alignment and fine movements, the considerations are 

otational and translational motions in three mutually perpendicular directions (X, Y 
id Z) should be independent. They should not disturb the prevailing UHV conditions 
F the beamline. 

ccuracies of translational movements should be of the order of 1 ± 0.5 mm. 
ccuracies of rotational movements should be of the order of 1 ± 0.2milliradian. 

the linear and angulai' positions are not maintained within the accuracies mentioned 
e, the image at sample position will be shifted from the mean position by about 0.3 mm 
ting in loss of flux of the order of 25%. 


Mirror boxes: The UHV mirror boxes needed several ports of various sizes not 
to connect them to the beamline but also to connect mirror mounts, gauge heads etc. 
[dition, ports also were provided for handling the mirror mounts and viewing the beam 
on the mirror. Table 1 gives a brief description of the ports of the mirror boxes. 


; 1. Description of the ports of the mirror boxes. 


T 


Quantity 


1 

5 




3 


:f 


Description 
Mirror mount (Port 1) 

Beam entry (Port 2) 

Beam exit (Port 3) 

Manual alignment (Ports 4 & 5) 
Sputter ion pump (Port 6) 

BA Gauge (Port 7) 

Roughing pump (Port 8) 
Pirani-Penning Gauge (Port 9) 

View Port (Port 10) 


1 
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Figure 2. Schematic of mirror box Mi . 
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3. Design details 

Based on the design considerations mentioned above and on the guidelines from the beam¬ 
lines of the Ultraviolet Synchrotron Orbital Radiation Facility (UVSOR), Japan (Sakai et 
al 1987), mirror chambers and mirror mounts were designed and fabricated. The two 
mirror boxes are mirror images of each other with a few minor differences depending 
on the entry and exit of light (Ramamurthi 1992; Meenakshi Raja Rao et al 1992, 1995). 
Schematics of the mirror boxes and mirror mounts are shown in figures 2 and 3 respectively. 
Figures 4 and 5 show photographs of the mirror mount and mirror chamber respectively. 
The mirror is enclosed in a frame provided with adjustable screws on the min'or mount. 
This arrangement facilitates the rotation of the mirror around an axis which is perpendic¬ 
ular to the plane of the mirror. The mirror frame is mounted on a copper block having 
an angle equal to the mirror tilt. The mirror frame has an arrangement for locating the 
mirror accurately by using fine pitch positioning screws. Two pivot joints provided on 
the frame impart the rotation necessary for angle correction. Rotation of the mirror is 
controlled from outside by a bellows-sealed mechanism using a micrometer head and 
a spring loaded plunger. The tube which imparts this rotation also acts as a carrier for 
coolant to the mirror frame. Multiplane adjustment of the mirror is provided by using 
three long bellows-sealed shafts located in a right-angled triangular geometry. The mirror 
mount is welded to a 203CF flange and this flange is connected to port 1 of the mirror 
box. 


4. Materials and fabrication 

SS 304L has been used for fabrication of most of the components of the mirror mounts 
and mirror boxes. The bellows and coolant channel used are made of SS 316. The sup¬ 
porting block of the mirror holder is made of copper for better thermal conductivity. The 
chambers and mounts are TIG-welded from inside, cleaned, electropolished, and degassed 
in a vacuum furnace at a temperature of 800°C for 8 h. This procedure reduced degassing 
thereby saving time taken for achieving ultimate pressure. 
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Testing setup 

mirror mounts were connected to the mirror chamber after degassing all the compo- 
s and the mirror box in a vacuum furnace. A 2701/s sputter ion pump was used to 
eve the ultimate pressure. A pirani gauge was used to measure pressure up to 10~^ mbar. 
ayard-Alpert nude ionisation gauge (model IG5M of M/S Edwards, UK) was used to 
sure pressure from 10““^ mbar to 10~^ mbar and a penning gauge and a leak detector 
j connected to the chamber with the help of a “T” through a turbomolecular pumping 
on (Model Turbopac 5150 of M/S Alcatel, France). The rest of the ports of the cham- 
were blanked off. Two sight glasses were provided for the entry and exit of collimated 
r light from He-Ne Laser used for testing the mirror mount mechanism. All gauges, 
control units, pumps and their accessories were provided power through the regu- 
1 power supply. Heating of the chamber, mount, ion pump etc. for degassing purposes 
carried out using different heating elements operated through temperature controllers, 
the vacuum sealings were done by metal gaskets made of OFHC copper. The experi- 
ital setup showing different instruments used for leak detection and evaluation is shown 
gures 6 and 7. 


1. S.I.P. 

2. MIRROR CHAMBER-MI 

3. BA GAUGE HEAD 

4. UHV VALVE 

5. T.C.GAUGE HEAD 

6. SORPTION PUMP 

7. DIAPHRAGM PUMP 

8. T.M.P.& ROTARY PUMPING STATION 

9. BA GAUGE 
10.S.I.P. POWER SUPPLY 

11. LEAK DETECTOR 

12. 'HEATERS 



Figure 6. Side view of the testing setup (drawing is not to scale). 
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1.B.A.GAUGE HEAD 
2;UHV ANGLE VALVE 
3JHERMOCOPLE GAUGE HEAD 

4. BELLOW 

5. LEAK DEFECTOR CELL 

6. PENNING GAUGE HEAD 
T.T.M.P^O ROTARY PUMP STATION 

8. PENNING GAUGE 

9. He-LEAK DETECTOR 

10.B.A. GAUGE 
11.S.1.P.POWER SUPPLY 


12. He-CYLINDER 

13. ACETONE CONTAINER 
14.S.I.P. 

15. DIAPHRAGM PUMP 

16. SORPTION PUMP 

17. MASS ANALYSER HEAD 
ia.R.F. HEAD 

19. RECORDER 

20. QMS CONTROLLER 

21. PERSONAL COMPUTOR 



Figure 7. Front view of the testing setup (drawing is not to scale). 


6. Testing procedure 

To start with, the mirror chamber was leak-tested for welding and gasket leaks of the 
order of 10“^® std.cc/s*. A pressure of 1.0 x 10“^ mbar was achieved with the help of a 
turbomolecular pump and a sputter ion pump without baking the chamber or the sputter 
ion pump (unconditioned). The residual gas spectrum was recorded using a quadrupole 
mass analyzer to ascertain the various components of residual gases and their relative 
concentrations as shown in figure 8. After baking the sputter ion pump and the chamber 
for about 8 h at 250°C and pumping the chamber with sputter ion pump, a pressure of 1,0 x 
10~^ mbar was achieved. The residual gas spectrum was recorded again at this pressure 
(figure 9). From figures 8 and 9 it can be seen that water vapour, CO 2 , N 2 and hydrocarbons 
which were the major constituents before degassing, have been suppressed and that H; 
is the only predominant gas component after baking. Thus hydrocarbon-free ultra-big] 
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PRESSURE INSIDE THE CHAMBER 4.5xio"®m bor Figure 8. Residual gas spectrum before 
PRESSURE AT THE ANALYSER HEAD 1-0 x io~®m.bar baking the miiTor chamber. 


um essential for SRS was achieved. After achieving this ultimate pressure, the mirror 
Its were tested for their performance under UHV conditions (i.e., reproducibility 
ninimum attainable translational and rotational displacement) using an He-Ne laser, 
test was performed by rotating the mirror about an axis which is perpendicular to 
ncident beam direction and measuring the image displacement on a screen placed 
iistance of 1250 mm (corresponding to the distance between pole of the first mirror 
mtrance slit of the monochromator). The minimum rotation of the micrometer screw 
rted a rotation of 0.0033° to the mirror which resulted in an image displacement of 
at the entrance slit. This is adequate as entrance slit width is planned to be of the order 
lO/x. For the second mirror mount similar procedure was repeated and same result of 
is obtained as the minimum displacement of the image and is considered to be very 
. for a spot size of 1 mm x 1 mm at the sample position. 


Conclusions 

mirror chambers and two mirror mounts for photophysics beamline were designed, 
cated and tested. The mirror mounts were tested for their performance at atmospheric 
:ure and at a pressure of 1.0 x 10~^mbar. The residual gas spectrum recorded at 
< 10“^mbar indicated that H 2 is the major constituent of the residual gases. The 
mum displacement of the image at the entrance slit (for mirror Mi) and at sample (for 
nirror M 2 ) is 75/x. This value is adequate as slit openings are 100/x for entrance as 
as exit slits and the sample size is 1 mm x 1 mm. 
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X 


residual gas spectrum 
after baking 







PRESSURE INSIDE THE CHAMBER i 0 k iO"*m oor Figure 9. Residual gas spectrum at ulti 
PRESSURE AT the analyser HEAD 3 5KiO“*m bof mate pressure. 
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Abstract. Discrete event simulation has been widely used to model and eval¬ 
uate computer and engineering systems and has been an on-going area of 
research and development. This paper presents an overview of the field. It 
covers specifications of discrete event systems, simulation methodology, sim¬ 
ulation languages, data structures for event management, and front and back¬ 
end support in simulation packages including random number generation and 
resource management. The emphasis of the survey is on simulation methodo¬ 
logy and event scheduling, which forms the core of any simulation package or 
environment. 

Keywords. Discrete event simulation; event scheduling; process interaction; 
priority queue; simulation languages. 


Introduction 

s design of a system or a process often needs to be evaluated for correctness and 
pneering properties before its implementation. Simulation is a cost-effective mechanism 
evaluate system and process design. Likewise, to study the behaviour of an existing 
fern or process, simulation is often more cost effective than direct system or process 
asurement. Depending on the underlying system model, the simulation will take the 
m of solution of a set of equations, as in the case of a continuous system model, or 
jcution of event-based program code, as in the case of a discrete-event system model, 
this paper we will consider only simulation of discrete-event systems. 

Ever since the sixties, discrete-event simulation has been widely used for modelling 
1 evaluating computer systems, computer networks, real-time systems, distributed sys- 
is, database management systems, manufacturing systems etc. For example, to evaluate 
^ configuration of a computer system for a banking application, to evaluate a resource 
inagement policy in an operating system, to study the behaviour of a local area network 
nmunication protocol, or to examine strategies for iob shoo scheduling. A number of 
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text-books and papers describe traditional as well as non-traditional uses of discrete event 
simulation, for example, Banks & Carson (1984), Jain (1991), Law & Kelton (1993), Banks 
& Norman (1996). 

Given that several engineering disciplines have found the need for discrete event sim¬ 
ulation, several languages and packages have been developed, both for general purpose 
use, as well as for a suitable set of applications. Advances in software development such 
as object-oriented design, data stmctures, and graphical user interfaces have caused ad¬ 
vances in simulation techniques and software. As a result, discrete event simulation is an 
active area of research and development. This paper presents a survey of state of the art 
in sequential discrete event simulation'. The focus will be on simulation methodologies, 
event scheduling, languages and modelling support in software packages. 

The rest of this report is organised as follows. Section 2 defines the terms and formalisms 
for discrete event systems. Section 3 surveys the different strategies that have been used 
for discrete event simulation. Section 4 presents a survey of simulation languages and 
packages. A core aspect of discrete event simulation, that is, future event scheduling is 
reviewed in § 5. Front-end and back-end support issues are discussed in § 6. Finally, § 7 
summarises the main aspects of this paper. 


2. Discrete event systems and their specification 

To model any system we first need to define its state space, i.e., the variables that govern 
the behaviour of the system with respect to the metrics being estimated. If the variables 
continuously change with time it is called a continuous system. If the system state instan¬ 
taneously changes at discrete points in time, instead of continuously, it is called a discrete 
system. 

In discrete systems whenever some state variables instantaneously change value, this 
occurrence is denoted by the term event. The general behaviour of a discrete event system 
is that the system starts out with some initial state. The system remains in that state for a 
duration of time. Then an event occurs which causes the system to instantaneously transit 
to a new state. This behaviour repeats with the system transiting from state to state over 
time but remaining in a specific state for a duration of time. This is opposed to continuous 
systems where the system continuously changes state with time. 

Let us consider a simple example of a discrete event system (DEVS), a single server 
queue that serves customers in first-come-first-serve order. It is assumed that the system 
is work-conserving, that is, the server will not be idle if there is a customer waiting for 
service and the server will not abmptly stop serving a customer. We model the state to be 
the number of customers in the system. The system starts out as empty. The first event 
to occur is a customer arrival and the state changes to one customer in the system. At 
this point the server begins service and the customer’s departure is scheduled. The next 
event to occur can either be the second customer’s arrival or the first customer’s departure, 
whichever occurs first in time. In the former case, the state will change to two customers 


Though distributed simulation is an active research topic, the majority of industry uses sequential discrete event 
simulation on account of the simplicity in description 


and in the latter case the state will return to the system being empty. The events can also be 
specified more elaborately, for example, start service and end service. In this case events 
occur instantaneously. A customer arrival at an empty queue will cause a start service to 
occur at the same time, without any change in state^. 

To model a DEVS we need to describe the state space of the system, the events in the 
system, the state transitions upon events and the times at which events will occur. The 
event space can be classified further as external input events, external output events and 
internal events. External input events are events that are triggered from outside the system, 
as in the case of a customer arrival at the queue. External output events are those that are 
generated as output from the system, as in the case of customer departure. Internal events 
are changes in state variables that do not affect the system environment, e.g., start service. 
The simulation of the discrete event system is done by means of generating events and 
executing the actions associated with the events. 

A DEVS can be precisely modelled using the abstract specification first presented by 
Zeigler (1976) (see also Evans 1988 and Fishwick 1993). This formalism has been widely 
used in the DEVS literature. Let X denote the set of external input event types to the discrete 
event system. Let O denote the set of external output event types. Let S denote the set of 
states of the system, where a subset of the state variables includes the list of future-event 
times at the given time instant. Then the DEVS abstraction is given by the 7-tupIe 

(X, O, <S, t)i 

where Sjnt : S 5 is the internal transition function dictating state transitions due to 
internal events. 

<5ext : Q X X -> <S is the external transition function dictating state transitions due to 
external input events. Q = {( 5 , e)|5' e 5, 0 < e < t(.v)} is the total state set of the model; 
(s, e) represents the state of having been in state .9 for elapsed time e. 

A.; S -> O is the output function generating external events, and 

T : S TZq is the time advance function. If the system is in state s at time t tlien 
the system will remain in that state until time / + T(.y). In other words, t( ) > 0 is the 
minimum of future event times. 

Composite models are constructed in Zeigler (1987) by coupling models (either atomic 
or composite) by means of external input coupling, external output coupling, and internal 
coupling. External input coupling specifies how input events of the composite model are 
identified with the input events of the components. External output coupling specifies 
how output events of the composite model ai’e identified with the output events of the 
components. Internal coupling specifies how the components inside the coupled model are 
interconnected by means of connections from output events of components to input events 
of others. 


^If the modeller so desires he can specify the state as the tuple (number of customers in queue, number of customers 
in server) so that even instantaneous events are associated with a change in state 
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3. Methodologies of discrete event simulation 

Section 2 defined discrete event systems. We now consider how to simulate discrete even 
systems, specifically, from a programmer’s viewpoint. We describe the most commoi 
strategies for designing and implementing discrete event simulation programs. (See Evan 
1988 for more details.) Three such methodologies are followed in the literature: even 
scheduling, activity scanning, and process interaction. 

3.1 Event scheduling 

In this strategy a list of all events in the system is first constmcted. Each event is take 
individually and described in terms of the particular interaction between entity (say cus 
tomer) and resource (say server). Associated with each event is the corresponding actio 
or procedure to be invoked when the event occurs. Consider for example a single serve 
queue: the events of interest are customer arrival, start service, end service, and leav 
system. The processing required at say end service is to compute system metrics, e.g 
response time, for the departing customer, clear its resource allocation (if needed), chec 
if there is another customer waiting in queue. If so, then schedule a start service event fc 
that customer. 

While the approach of event scheduling is straightforward, it involves programming i 
a low level. All events have to be enumerated in one place. Care must be taken to ensui 
that reactivations are scheduled, as in the case of the start service event being scheduled £ 
soon as the previous end-service is completed. Likewise, the start service event must als 
be scheduled if an arriving customer finds the queue empty. These reactivations typical! 
occur in zero time, but if they are not explicitly specified the simulation will terminati 
Thus, the responsibility of event scheduling lies entirely on the programmer, which is wh 
this strategy is so called. However, because of the explicit specification the efficiency c 
this strategy is best as compared to the other strategies. 

3.2 Activity scanning 

The purpose of this strategy is to overcome the reactivation problem of event schedulin; 
As before, the list of events is first drawn up. But now the events are classified into tw 
types: B-activities and C-activities. B-activities are activities that are Bound to occu 
whereas C-activities are those that are Conditional. For example, the arrival of a custoim 
is a B-activity since it is unconditional. However, the start service activity is a C-activii 
since it occurs only if the previous event was end-service and the queue size is greater th£ 
zero, or if the previous event was a customer arrival and the queue is empty. 

In this strategy the programmer does not have to explicitly specify reactivations. T1 
system automatically handles them. The generic stmcture of a simulation written in th 
strategy consists of three phases. The A-phase advances the time for simulation, the B -phai 
checks the type of the B-activity that has occurred at that time and executes the procedu 
associated with it, and the C-phase checks the C-activity (or activities), if any, that need 
be executed at that time, and executes the corresponding event-handling procedure. No 
that the A, B and C phases must proceed in sequential order. Each C-activity has a hej 


valuates to true. 

While this strategy relieves the programmer from explicitly programming reactivations, 
le execution is inefficient because all the C-activities need to be checked in the C-phase. If 
lany C-activities are rare-events, then most of the effort spent is wasted. Such inefficiency 
/as absent in event scheduling because of the explicit invocation of C-activities. To make 
le processing efficient and yet retain the simulation strategy, attempts have been made to 
roup C-activities and B-activities that shai'e the same entity. Thus, a set of C-activities is 
rouped with all the B-activities from which they obtain their entities. During the C-phase 
nly those C-activities need to be examined which are grouped with the B-activity that 
xecuted. 

.3 Process interaction 

a the preceding discussion it seems natural to group B and C-activities pertaining to a 
ommon entity, and to order them in their actual sequence of succession. Thus, rather than 
iew the system as a set of event modules it is more natural to view the system as flows of 
ntities. 

The process interaction methodology describes the system’s workings from the view- 
oint of an entity flowing through the system. The model is thus described as a projected 
fe-history of a typical entity, called a process (Evans 1988). To start with, the resources 
nd entities are identified. Then the entity’s resource requirements and interaction with 
ssources, duration of activities etc., are all described in the process. 

Each new instance of an entity is a separate process instance. The execution of a process 
imulation takes a single process instance and executes it until deactivation. That is, if the 
rocess has to wait for a resource, or if the process is to spend a period of time in a given 
ctivity (e.g., service time). At that time another process instance is taken and reactivated 
rom the position where it left off. 

Thus, the process oriented approach is modular. All events and activities comprising a 
irocess are described in the form of the entity model in one place. Note that processes may 
Iso apply to resources. For example, a server process. In the single server queue example, 
tie process oriented simulation can be expressed by means of the entity customer. The 
equence of description is 

generate customer every inter_arrival 
acquire server (waits in queue if necessary) 
get serviced for t time units 
release server 
destroy customer 

In this case events are implicitly handled by the language. The modular approach makes it 
^ery easy to concisely specify the customer’s behaviour. To implement the process oriented 
trategy the language needs to provide support for process suspension and reactivation. For 
xample, when the customer requests for the server and the server is busy then that customer 
nstance needs to be deactivated. When the server is released then the first waiting customer 
n the queue needs to be reactivated. This is usually handled by means of a restricted form 
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of coroutine Q.2i\\Q.6. semi-coroutine (Dahl 1968) or generator (Marlin 1980) construct 
provided in the simulation language. In this form, the choice of coroutine to be resumed 
is determined by a master module that decides on the basis of the next event that is to be 
activated. 

The process oriented approach allows for high level programming but requires that 
the simulation language provide coroutine support, which is typically not the case with 
general purpose languages such as C and C++. (See § 4 for extensions to general purpose 
languages.) Further, by its very nature it can cause the system to deadlock if two process 
instances are waiting for each other to release their resources before acquiring those ol 
the other. This may not be the intended purpose in cases where more than one resource 
is accessed at the same time, but the language does not support atomic access of multiple 
resources. (See Evans 1988 for examples.) However, in most cases this is either a non-issue 
or special constructs are provided in the language. 


3.4 Summary 

As described in this section, there are three prevalent foniis of expressing discrete eveni 
simulation programs: Event scheduling, activity scanning and process interaction. While 
the expressive power is the same in each case, they differ in the level of programming 
Event scheduling is the most efficient mechanism in terms of execution speed but involvej 
explicit specification of events and their actions globally across all parts of the system 
as well as reactivations of conditional events. Activity scanning implicitly handles reac¬ 
tivations by explicitly stating which events are unconditional and which are conditional 
However, it incurs a loss of efficiency due to explicit checks for conditional events at time 
advances. 

Process interaction or process orientation offers a more natural modelling environmen 
to the programmer since the focus is on the entity and its sequence of activities rather thar 
on a global perspective of the entire system. It is efficient at run time but requires support fo; 
coroutines in the specification language. With the emergence of distributed systems somt 
authors have proposed a message based approach to discrete event simulation (Bagrodi. 
et al 1987) where the physical system is modelled as a set of communication processes 
Events are modelled by message communications. An entity is modelled as a messagt 
communicating process. 


4. Discrete event simulation languages and APIs 

Since discrete event simulation is an important field in its own right, a number of language; 
have been designed specifically for the purpose. In other cases general purpose language 
have been enhanced. There is also a common trend to provide application programminj 

infP.T*flPlf*PC 'Fr\r Hi Cr*rPtP pvpnt* CirVinlofl/A-n in a n-ono-rol TMii-n/^oo l^irrVi lovrjal lonmiorf/a Ti 


Historically, discrete event simulation languages were promoted since the 60’s. Simscript 
used the event scheduling strategy, GSP the activity scanning and GPSS the process interac¬ 
tion strategy. Later with the popularity of Algol60, Simula was proposed as a preprocessor 
to perform simulation programming. 

Currently, GPSS is generally regai'ded as the most popular simulation language. It 
is a block-structured language that was conceived to run on mainframes and supported 
by IBM. The GPSS/H processor, a product of Wolverine Software, USA, was first re¬ 
leased in the late 1970s and is available on a variety of hardware platforms ranging from 
PCs to Sun workstations (Scher 1991). The GPSS model is that transactions (i.e., en¬ 
tities) flow through systems to produce dynamic effects. They interact with resources 
by flowing into blocks or program statements. A block remains inactive until a transac¬ 
tion attempts to enter it. Over 40 different blocks describe resource requirements, condi¬ 
tional branching, queueing, data collection, report generation and attribute control (Evans 
1988). 

Simscript started out as a language for event scheduling but the modem version Sim¬ 
script II.5 incorporates a process interaction strategy. Like GPSS, Simscript is commer¬ 
cially available (Markowitz et al 1987). On the other hand Simula, which is more of an 
object oriented language, has received less commercial support as a simulation language. 
SLAM (Pritsker 1986) and SIM AN (Pegden et al 1990) have also been used over the last 
decade. Recently, there is a trend towards object oriented simulation languages as in the 
case of MODSIM III which is commercially available from CACI, and HSL which was 
proposed in Sanderson et a/ (1991). 

In general, event scheduling languages provide features for specifications of events and 
associated actions. The actions will be the code necessary to update the system’s state and 
generation of a random time for a future event. The process-oriented languages, typically, 
include a richer set of language constructs which apart from time delays include interaction 
of processes with resources, such as suspend at a specific queue, resume once the head 
of queue is reached, or suspend until is specific condition is satisfied in the forni of a 
wait-until constmct. The constructs in GPSS, for example, include GENERATE an entity, 
ENTER into a resource, ADVANCE the time spent at a resource, QUEUE at a resource, 
LEAVE the resource and TERMINATE an entity. 

A number of other languages have been proposed in the literature (see Evans 1988 for 
example) and many comparisons have been done (cf. Tocher 1965, Dahl 1968 and Virjo 
1972). Simulation languages commercially available for personal computers have also 
been compared, e.g. the comparison between Simplei and Simian in Houten (1988). 


_J 4.2 Extensions and APIs 

The primary disadvantage of a special purpose simulation language is that it has to be 
learned and compilers for it have to be bought. To overcome this problem a number 
of extensions and application programming interfaces (APIs) for simulation have been 
proposed for general purpose languages. 
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Such an effort started with GASP in the 60s. GASP is a collection of FORTRAN 
subroutines for simulation, and has been very popular among FORTRAN users. Anothei 
popular choice has been to extend Pascal for quasi-parallel programming (Kaubisch et a 
1976; Kriz & Landmayr 1980) or provide APIs in it (Hac 1982; Marsden 1984). Extension: 
to Algol68 have been proposed in Sheam (1975). Use of PL/I for discrete event simulatioi 
has been proposed by Hac (1984), of Ada by Bruno (1984), of Modula-2 in HPSIM (Sharm: 
& Rose 1988) and of SR by Olsson (1990). 

In the late 80’s and in the 90’s, several libraries for discrete event simulation are availabL 
in C and C-(-+. In C-f-1- it is possible to write class libraries that support coroutine 
(cf. Stroustrup & Shapiro 1987) and this allows for process oriented simulation. Proces 
oriented packages in C or C-b-f- include CSIM (Schwetman 1988,1990), SIM+-I- (Lomo’ 
& Baezner 1990) and YacSim from Rice University. Event scheduling packages include 
SMPL (MacDougall 1987), and SimPack (Fishwick 1992). More recent packages in C-F-1 
provide object-oriented simulation features, for example, C-F+SIM (Little &. McCue 1994 
and Awesime (Grunwald 1991). 

Apart from extensions and libraries some packages use only diagrams as a means o 
modelling and simulation. This obviates the need to learn a special purpose language, but i 
can be rather cumbersome if there are a number of diagrammatic blocks as in GPSS. On tb 
other hand specific tools such as Petri-net analysers (cf. Evans 1988) or queueing networ] 
simulators (cf. Melamed & Morris 1985, Funka-Lea etal\99\) ai'e useful within the rang 
of applications they model. To model applications beyond their limits one must opt fo 
a programming language. More recently, Shanbagh & Gopinath (1997) have proposed 
C-F-F simulator generator from graphical specifications. 


5. Future event management 

No matter what be the simulation strategy or the simulation language, in discrete ever 
simulation the underlying mechanism is scheduling of events. Events are generated fc 
future times. At any time the next scheduling instant is the minimum of the future ever 
times. In practice, the number of future events may range from a handful, say in a singl 
server queueing system at light to moderate load, to a huge number of events, say in th 
simulation of a wide area network. 

The following operations are needed on future event list data structure: insert an ever 
(represented by its time and a pointer to associated event information), delete minimw 
time event and return the information pointer, delete or cancel any arbitrary event froi 
the hst. The first two are the most common operations, the last one is used occasionally i 
some simulation applications, e.g., resource preemption. 

The abstract data type appropriate for future event scheduling is the priority queu< 
which can be implemented in many ways. The performance of the implementation ; 
subject to the onerational nrofile of inserts, delete-mins. and cancels. Not siimrisinfrlv tb 
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5.1 Future event list implementations 

The simplest representation of the future event list is an array or linked list ordered by time. 
Though simple to implement it is rather inefficient for large list sizes since search time is 
linear in the size of the list. For this reason researchers have proposed index structures over 
linked lists (Wyman 1976; Franta & Maly 1977, 1978; Hemiksen 1977, 1983; Comfort 
1979; Nevalainen & Teuhola 1979; Davey & Vaucher 1980; Blackstone etn/1981; O’Keefe 
1985), a popular scheme among these being that of Henriksen (1977, 1983), which has 
been incorporated into GPSS (Henriksen & Crain 1982). 

Another common approach is to use a d-heap. The 2-heap was proposed by Williams 
(1964) and then generalised to ^ > 2 by Johnson (1975). In a d-heap the nodes are 
maintained in heap-order, where the value of a node is no less than the value of its parent. 
The d-heap is a complete d-ary tree satisfying heap order. Using breadth first search the 
nodes can be indexed into a single array. Other heap-based implementations include the 
leftist tree (Crane 1972, Knuth 1973b), pagoda (Francon et al 1978), skew heap (Sleator Sc 
Tarjan 1983, 1986), binomial queue (Vuillemin 191%), pairing heap (Fredman et al 1986, 
Stasko & Vitter 1987), Fibonacci heap (Fredman & Tarjan 1987), relaxed heap (Driscoll 
et al 1988) and radix heap (Ahuja et al 1990). 

Search tree-based structures have been popular as well. Binary search trees are the 
natural choice and have been analysed for future event scheduling in Evans (1983) and 
Vaucher & Duval (1975). A variant, called p-tree, to combine the advantage of linear list 
and efficiency of tree structures was proposed in Jonassen & Dahl (1975). Among balanced 
trees or partially balanced trees the most popular version for priority queues is the splay 
tree (Sleator & Tarjan 1985). 

A particular type of implementation called calendar queue was proposed by Brown 
(1988) and also independently proposed by Davidson (1989). In this representation time 
is split into buckets and keys fall within bucket ranges. Indices are wrapped around for the 
‘next year’. A more recent structure called fishspear (Fischer Sc Paterson 1994) has worst 
case performance as the d-heap but is oriented towards better performance in the common 
case and can also be implemented for sequential storage. 

5.2 Analyses and performance comparisons 

Ordinary linked lists sorted by time require 0(n) time for insert and 0(1) time for delete- 
min and delete. On the other hand d-heaps require 0(log n) for insert, delete and delete- 
min (Tarjan 1983). Bollobas Sc Simon (1985) analyse repeated random insertions into a 
heap where each ordering of the inserted elements is equally likely. They obtain that the 
number of exchanges per insertion is bounded by a constant of about 1.76. Fibonacci heaps 
on the other hand have 0(log n) amortised time for delete and delete-min and 0(1) for 
insert (Fredman & Tarjan 1987). Driscoll et al (1988) show that these times hold in the 
worst case for relaxed heaps. 

Various comparisons of priority queue implementations have been reported in the liter¬ 
ature. Several comparisons have been done under the hold model (Vaucher & Duval 1975) 
where a hold operation is one that removes an event from the priority queue and schedules 
a new event after an interval of time d from a specific distribution F. The hold model 
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consists of a sequence of hold operations and is paranieterised by the number of events 
in the event queue and the distribution !F. It has been used in many of the early studies 
including Comfort (1979), Davey & Vaucher (1980), Englebrecht-Wiggans & Maxwell 
(1978), Franta & Maly (1977,1978), Henriksen (1977), Jonassen & Dahl (1975), Vaucher 
& Duval (1975), and Vaucher (1977). 

Jones (1986) compared several implementations under the hold model and showed many 
to outperform heaps. The splay tree was shown to have best performance in his study. In a 
later study Brown (1988) showed that under the hold model the calendar queue performs 
better than the linear linked list and the splay tree. Chung et al (1993) used a Markov hold 
model to evaluate 14 implementations. Also, using a token ring simulation for comparison 
they recommend using the splay tree and the calendar queue while stating that heaps are 
quite ‘stable’ albeit with lower performance. 

McCormack & Sargent (1981) compare several implementations from Comfort (1979), 
Davey & Vaucher (1980), Franta & Maly (1977), Henriksen (1977), Taneri (1976), Ulrich 
(1978), Vaucher & Duval (1975) and Wyman (1976), and show that results from real 
simulation runs are different from that when the hold model is used. They show that 
Henriksen’s method (Henriksen 1977) and the modified heap perfomn well and are less 
sensitive to scheduling distributions. 

Thus, in general, it is not readily apparent which implementation works best for a 
given simulation application. Worst case analyses do not reflect performance accurately 
for the average case. Average case analyses have been done under restrictive assumptions 
or special cases of applications. It will be desirable therefore for simulation packages to 
adopt a variety of future event management mechanisms as in SimPack (Fishwick 1992). 
A knowledgeable user can select the right mechanism but what would be more desirable is 
a high level interface to select the right mechanism for the simulation’s operational profile. 
Typically, in simulations one needs to run several experiments in the debugging stage itself 
and during this phase the various priority queue implementations can be compared. 

6. Front end and back end support 

Though scheduling of future events forms the core of discrete event simulation, there are 
a number of other features that are desirable in a simulation environment. They can be 
classified into front end requirements in the form of diagram editing and graphical output, 
and back end support in the form of random number generation, resource management, 
statistical libraries. This section first specifies desirable features of simulation front ends 
and then desirable features of back ends. 

6.1 Simulation front end 

The simulation front end must in the least capture the system topology and possibly simplify 
the model description in terms of user input, and display graphical output of simulation 
results. 

The most general case of user input should allow for a diagram editor to specify user de¬ 
fined icons to represent processes or resources, connectivity across icons, and connectivity 
constraints if any. Few packages allow this, however. GPSS has a cumbersome diagram 
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notation with over 40 different diagram types representing equivalence to program state¬ 
ments. Typically, no package allows for general purpose model specifications. However, 
specific application domains such as queueing network simulations capture all user spec¬ 
ifications through the front end as in the case of PAW (Melamed & Morris 1985) and its 
successor Q-j- (Funka-Lea etal 1991; also see Shanbagh & Gopinath 1997). 

6.2 Simulation back end 

All simulation environments include support for random number generation, and resource 
management. Some also include statistical libraries. We elaborate on each of these features 
below. Note that our emphasis is only on what is provided in standard packages. There ai'e 
other important aspects such as variance reduction of output and rare-event simulation, 
which are not covered in this paper^. 

6.2a Random number generation: Random number generation forms an integral part 
of a simulation environment. The underlying model of a discrete event system assumes 
that the system remains for a given time in each state. This duration of time is modelled 
using a random number distribution, for example, the inter-arrival time at a queue is often 
modelled as an exponential distribution. Likewise, service time of a customer in a queue, 
number of database items that a query will access can be modelled using random number 
distributions. 

Every operating system is usually equipped with a random number generator that gen¬ 
erates uniform random variates. In simulation we additionally require generation of non- 
unifonn random variates. Generation of random numbers is more difficult than what one 
might expect. As Knuth (1973a) says, “Random numbers should not be generated by a 
method chosen at random.” 

Several packages and studies have used defective random number generators. For in¬ 
stance, the study of Majumdar etal (1988) that simulated performance of parallel processor 
allocation policies used a defective technique for random number generation which led to 
incorrect policy comparisons. This was later corrected in Leutenegger & Vernon (1990). 
The 1988 version of CSIM (Schwetman 1988) used the rand{ ) function which is well 
known to have poor random number generation as given in the UNIX system manual 
page for random( ). A survey of more than 50 computer science text books that contained 
software for random number generation revealed that most of these generators are unsatis¬ 
factory (Park & Meller 1988). This shows the importance of using reliable random number 
generators as given in Kjiuth (1973a), Park & Meller (1988), and L’Ecuyer (1988). 

Any simulation environment must support a variety of distributions, both discrete and 
continuous. There should be support for multiple random number streams. The package 
should allow for transformations on random variables to support practical distributions 
as well as allow for empirical distributions as obtained from measured data. Typical dis¬ 
crete distributions include uniform, Bernoulli, binomial, geometric, Poisson and typical 
continuous distributions include uniform, exponential, Erlang, hyper-exponential, normal 


^The interested reader can find details in the July 1993 issue of ACM Transactions on Modelling and Computer 
Simulation 



numbers for many distributions. 

The general techniques that are used for non-uniform random number generation are 
the inverse method and acceptance-rejection. In the former, a random number is generatec 
by first generating a uniform number between 0 and 1, and then using it as an argumen 
to the inverse of the distribution. In the latter method, the required density is boundee 
by that of a scaled version of another distribution for which it is known how to generate 
random samples. The samples from the known distribution ai’e repeatedly taken until one 
falls under the required distribution. 

6,2b Resource management: Discrete event simulation is widely used to study the be 
haviour of resource contention. For example, contention for CPU and disk in compute 
systems, contention for database in DBMS, contention for machines in job shops, con 
tention for toll booths on highways, etc. Associated with each resource is a resource handle 
and contention queue(s) to store contending entities. The resource handler decides how tc 
schedule entities from the contention queue(s) on to the resource. 

The resource by itself may contain multiple servers, as in the case of a parallel processo 
or a petrol bunk. The entities may all contend in a single contention queue or may b( 
split across several queues each contending for a subset of the servers. Most simulatioi 
environments provide support for single server single queue resources. Some provid< 
support for multiple server single queue resources. 

The resource handling discipline can be preemptive or non-preemptive depending oi 
the application in hand. In computer systems preemption is very common at the CPI 
(but not at disk) whereas in manufacturing systems preemption of executing jobs at plant 
is typically absent. Among non-preemptive disciplines the most common ones are firs 
come first serve (FCFS) and fixed priority. Some systems also provide support for first fi 
and best fit. Among preemptive disciplines the most common one is fixed priority wit! 
preemptive resume. In CPU scheduling round robin is a common preemptive disciplim 
where preemption occurs on every time quantum. 

Having built-in resource scheduling disciplines simplifies the work of the programme 
who is now given access to insertion and deletion of entities in contention queues. If thi 
programmer desires to use a very specific discipline, the interface for using the disciplim 
must be the same as that for ones provided by the simulation environment. The simulatioi 
environment should provide the facility to integrate custom resource schedulers. 


6.2c Statistical libraries: The purpose of discrete event simulation is to study the be 
haviour of a given system. The behaviour of interest to the user is usually captured in th 
form of metrics such as average and variance of response time, throughput and resourc 
utilisation. To correctly estimate these metrics the programmer needs to insert measure 
ment probes at appropriate places in the program. Good simulation environments provid 
support for probes, that is, creation and initialisation, sampling, determining averages 
variance and distributions of accumulated data, as well as confidence intervals. A sophis 
ticated package will provide support for different types of probes, e.g., space average am 
time average. 
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Accumulation of samples can vary according to the estimation method being used. Two 
:ommon techniques for estimating results are regenerative simulation and the method 
3f batch means (Law & Kelton 1993). Regenerative simulation is widely applicable and 
Droduces correct results (Welch 1983). It essentially estimates metrics at system regen¬ 
eration points"*^ and determines confidence intervals (i.e., estimates of the variance of the 
iietric being analysed) across regeneration points. When enough regeneration points have 
3een encountered to meet the desired confidence interval the simulation stops. While this 
nethod produces con'ect results as per renewal theory, it can be rather costly in large sys- 
:ems to generate regeneration points. For this purpose the method of batch means is a more 
Tficient approach, where metrics are estimated at the end of a given batch size of samples 
md confidence intervals are calculated across batches. The batch size must be chosen with 
caution since a small batch size can cause correlation between successive batches. For more 
letails on statistical techniques to estimate steady state behaviour see Pawlikowski (1990). 

7. Summary 

vVe have surveyed the field of discrete event simulation on uniprocessors. We have sum¬ 
marised the formal specifications for discrete event systems. Various strategies for sim- 
alation, that is, event scheduling, activity scanning and process interaction have been 
reviewed. Discrete event simulation languages and extensions and APIs of general pur- 
30se languages for discrete event simulation have been briefly covered. Data structures 
for future event management have been surveyed. These include simple linked lists, heaps 
and variants and a variety of search trees and assorted data structures. Front end and back 
snd support for simulation have been described. Note that topics such as output analysis, 
variance reduction techniques, rare event simulation are specialised topics that deserve a 
separate survey in their own right, and have been treated as outside the scope of this paper. 
Likewise, emerging technologies such as object oriented simulation have not been covered. 

Currently, the trend has been to enable the user to build a simulation model of the system 
under consideration and to efficiently run the simulation code. Not much emphasis has been 
given on separating modelling from simulation as is prevalent in the continuous simulation 
world, e.g., simulation of chemical process plants. It would be desirable to create model 
libraries of resources or of subsystems which can be used for a variety of applications to be 
simulated. Typically, the approach is to rewrite code from simulation to simulation. This 
is not only wasteful in terms of development time but also incurs greater chances of bugs 
in the simulation. 
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Abstract. A bus system whose configuration can be dynamically changed 
is called reconfigurable bus system. In this paper, parallel algorithms for gen¬ 
erating combinations, subsets, and binary trees on linear processor array with 
reconfigurable bus systems (PARES) are presented. 

Keywords. Parallel algorithms; reconfigurable bus systems; combinations; 
subsets; binary trees. 

Introduction 

lerating combinations and permutations are common combinatorial problems. We de- 
e Ctnn to be the number of combinations obtained by choosing m objects out of n 
sets. Sequential algorithms optimally take a time of Oim*Cmn). Akl et al (1989/90) 
e presented a parallel algorithm, using an Exclusive Read, Exclusive Write, Parallel 
idom Access Machine (EREW PRAM) model, which takes 0{Cmn) time optimally, 
tion 3 considers the combinations generation using an m-processor linear PARES. This 
Drithm outputs a combination in constant time. The combinations are generated in lex- 
graphic ascending order. All the Cmn combinations can be generated and output in a 
e of 0(Cmn). Though our algorithm takes the same time as that of Akl et al (1989/90), 
present it because of the implementation feasibility of the PARES model, 
jeneration of subsets has applications in subset sum, knapsack, base enumeration and 
limal covering problems (Stojmenovic & Miyakawa 1983). A cost-optimal systolic 
orithm for generating subsets was presented in Tsay & Lee (1994). In § 4 we present a 
t-optimal parallel algorithm using linear PARES. 

fhe problem of enumerating n-noded binary trees have been studied by introducing 
iger sequences or permutations, which characterize trees in a combinatorial way. One 


sliminary version of this paper has been presented at the 6th National Seminar on Theoretical Computer Science, 


such sequence called P-sequence was introduced by Pallo & Racca (1985). They have 
provided a simple algorithm to generate P-sequences lexicographically. In § 5 an efficient 
and simple parallel algorithm for generating P-sequences is proposed which runs on a 
linear PARES. 

Zerling (1985) reported a new approach for generating rooted ordered binary trees with 
n-nodes in some order by using rotations. He established a 1-1 correspondence between 
binary trees and codewords related to rotations, called rotational admissible codewords 
for binaj-y trees, and presented a recursive algorithm to generate these codewords. Er 
(1989) later presented faster recursive algorithm to generate these codewords. Recently 
Makinen (1991) has given anon-recursive algorithm. All the above algorithms take 0(n) 
time in the worst case to produce the next codeword from the previous one. In § 6 a 
simple parallel algorithm for generating rotational-admissible codewords based on the 
non-recursive algorithm of Makinen (1991) is proposed. This algorithm mns on linear 
PARES. 

In the next section the PARES model is introduced. 


2. The PARES model 

A reconfigurable bus system is a bus system whose configuration can be dynamically 
changed. Processors in PARES are assumed to operate in single instruction multiple data 
(SIMD) mode; that is they perform the same instruction synchronously (at times only very' 
few processors may be active but this will be explicitly specified in the instruction). In a 
single unit of time, each processor can either select a switch configuration, perform a read 
or write operation on the bus, or perform a constant-time computation on its local data. 
The model allows several processors to read the same bus component, but does not allow 
more than one processor to write on the same bus component at the same time. By giving 
specific instructions, the processors reading from a bus can be restricted. For example, only 
when a particular condition is satisfied the processor may read the bus. Each processor may 
have its own built-in ports. These ports can be dynamically connected in pairs based on 
computational needs. When a bus configuration is established, processors that are attached 
to the same bus can communicate with one another with the above restrictions. It is assumed 
that such a communication takes one unit of time. This basic assumption on PARES models 
is acceptable because of the gap between signal propagation velocity and reconfiguring 
rates. With the current hardware technology, signal propagation velocity is very close to its 
limit (velocity of light), whereas the reconfiguration rates are bounded by both theoretical 
and technological restrictions. Thus the time for the switch state to change can be orders 
of magnitude longer than the time for the signal to traverse a distance equal to the diameter 
of a switch. It is also generally assumed that each processor can perform basic arithmetic 
and logical operations in unit time (Schuster 1991; Wang 1991). 

The linear PARES model can be described as follows: It consists of n-processors. Pi, 
P 2 ,Pn- Each processor is connected to its neighbouring processors. The connections 
are made through two built-in ports denoted by N and S (or L and R) of each processor. 
A local switchable network could dynamically change the configuration of the system. It 
is controlled by the processor itself. Figure 1 shows such a system with n =5. 
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Figure 1. A 5-processor linear PARES. Here P/’s - processors and S, N (L, R) are built-in ports. 


3. Generation of combinations in a linear PARES 

Consider a linear PARES with m-processors, each processor having six registers C, L, B, 
R, I and J. In the registers C and L, the components of current (which is used to generate 
the next one) and last combinations can be stored. Register B is used to store a boolean value 
by comparing C and L. Register R contains a copy of B of next processor. Registers I and J 
are used to store the indices of processors. Each combination is generated using its previous 
combination C/, z = 0, 1, 2,..., m — 1, and last combination L,- (Lj ~ n — m -{■ i 
z = 0, 1, 2,..., m — 1, in lexicographic order. Here, assume that we want to generate 
combinations of m integers from the first n integers (1 < m < n). Then it could be easily 
extended to an array of n elements by considering them as indices. The algorithm consists 
of the following steps. 

Algorithm 1. Parallel generation of combinations. 

Step 0: Initially Cj = ./ + 1; Ly = « — m -P y -f 1; /y = J and Jj = 0 are stored on 
corresponding registers of each processor Fy, j = 0,1, 2,..., m — 1. 

Step 7; Set the boolean variable bj with 1, if Cy = Lj, otherwise with 0 (if Cy < Ly) and 
store it in register B of each processor Fy, 0 < y < m — 1. 

Step 2: If bm-\ = 0, then set C,„_i = C„j_i -P 1 and store it in register C of processor 
P,n-i- Go to step 8. 

Step 3: If ^0 = 1 then, go to step 8 (the input combination is the last one i.e., same as Lj). 
Step 4: Each processor Fy, y = 1, 2,..., m - 1, sends a copy of bj to processor Pj-\ \ say 
ry_i, stored in R. 

Step 5: Each processor F/ for which bi = 1 connects port L to port F, for 1 < i < m — 1. 

Step 6: Processor Pj , for which bj -rj = 1, broadcasts Cy and /y to processors Pj^,k = j + 1, 
y + 2,..., m — 1, and received in registers C and J respectively. 

Step 7: Each processor Pk for which rk — I sets Ck = Cjt -P 4 — + 1, and stored in 

register C. 

Step 8\ Each processor F^, /: = 0, 1,..., /n - 1 prints its register C value. Then go to 
step 1. n 

To prove the correctness we proceed as follows: We can make the following observa- 

(•ir»nc rmnn fUA 1 AviArvcTrcmUiA dor-Anrlinfr nrrlAr nf PnmhinJlf'innc Tf C'r. C... i fltld 
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Cj ~ Lj (i.e., Z?i = ^2 = • • • = i>j-\ = 0). 

IfCy+i ^ Lj+] thenCy+i < Ly+i (by observation (ii)). From observation (i), Cy < Cy+i 
and since Lj < we have Cy+i Lj {Lj and Ly+i are consecutive), i.e., Cy+i :< Cj, 
which is a contradiction, therefore 

C'j+l = -LjH-l- 

By induction = L^ for all > y. □ 

If Z?o = 1 then Co = Lq, then C] = Li, Co = L 2 ,, C,n-i — Lm-\ (by lemma 1). 
This implies that the current combination is the same as the last combination (step 3). 
If b,n-\ = 0, there is no Cy = Ly, for j = 0,1,..., m - 1, then the processor P^-i 
increments its component to produce the next combination in order (step 2). By lemma 1 
there exists a unique index j such that the sequence bQ,bi,..., bm-\ will be of the form 
0,0,..., 0,1, 1,..., 1, (first 1 is in processor Py+i). This ensures that there is a unique 
processor Py for which bj • ry = 1, therefore this processor can broadcast Cy and /y in 
step 6. It also proves that step 5 can create a bus starting from processor Py until processor 
Pm-\- To prove the correctness of step 7, we prove the following lemma. 

Lemma 2. Let Co, C\,..., C,n-\ and CIq, Cli,..., Cl^-i be the current input and out¬ 
put combinations of the above algorithm. Then the two combinations are in lexicographic 
ascending order. 

Proof Consider the boolean sequence 

bo, b\,..., btn-i = 0, 0,..., 0, 1,..., 1, 

(first 1 is at processor Py+i). Then from the algorithm we can see that 

Cl( = Ci for z = 0, 1, ,.., y — 1 

Cly = Cy + l>Cy 

Hence the two combinations are in lexicographic ascending order. □ 

It can be easily seen that each step of the algorithm takes only a constant amount of time 
as the broadcast operation takes constant time. Thus we can generate the next combination 
from the previous one in constant time. The algorithm can be made adaptive (i.e., to run 
on an array with k > m processors), if we divide the linear array into k/m groups of m 
processors such that each group can produce an interval of consecutive m-combinations. 
Note that in such cases we need to change the input (current and last) combinations 
correspondingly. 

4. Parallel generation of subsets 

We assume that each processor in the linear array has registers A, B, C, and D. Register 
A is used to store the elements of the current subset. B, C and D are used to store boolean 
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We; shall present a parallel algorithm for generating all subsets in lexicographic 

‘^^roftheset{l, 2 ,...,«}. 

F^arallel generation of subsets. 

^ ^ — O’ for i > 1, and bi = c/ = di = 0, for all i, and store them in 

C and D respectively. 

processor Pi prints ai for which at ^ 0. 

== n then stop. 

^;vith 1, if ai — 0, and 0 otherwise, and q with 1 if at = n, and 0 otherwise. 

processor Pi sends a copy of q to processor P/_i, received in D, for i = 
I^rocessor Pi for which ci • d/ = 1, sets a/ = u,- + 1. 

-<^ssor Pi for which c/ = 1, sets ai — 0. 

'H processor Pi for which di = 0, connects port L with R. Processor P/ for 
= I sends di value which is received in register D of Pi. 

/J> I =: 0, then each processor P,- sends a copy of register B value to Pj+i 
n, received in register D. Then processor P/ for which bi ■ di = I sets 
-T- 1. Then go to step 2. □ 

oil subset either contains only one n or none, there could be only one i for which 
is ensures that there could be almost one processor for which di = 1 in step 7. 
s tine entry prior to n is incremented by 1 (step 5) and the entry corresponding 
:o ?:ero (step 6) to get the next subset. Otherwise (if n is not a member of current 
s p rocessor for which bi • d/ = 1 sets its component of the next subset to a; _ i +1. 
thni generates subsets of the set (1,2, 3,4} in the following order: 

2 

2 2 3 

2 3 2 3 4 

2 3 4 2 4 

2 4 3 

3 3 4 

3 4 4 

4 

2 ;asily seen that the algorithm takes only constant time to generate the next subset 
previous one. 

erating binary trees using P-sequences 

slnall introduce P-sequences. Let rj be the degree of the root of the binary tree 
Pv Tt and Tr, the left and right subtrees of T. Two orders, namely A-order and 


634 


P Thangavel 


DEFINITION 1 

Two trees T and T' are in A-order, denoted by T < T' if (1) |T1 < |T'| or (2) |r| = |T'l 
and Tl < T[ or (3) \T\ = \T'\ and Ti = T[ and Tr < where |71 is the number of 

leaves in T. 

DEFINITION 2 

Two trees T and T' are in B-order, denoted by 7 < T' if (1) rj < rjf or (2) ty = rj> and 
Tl < T[ or (3) iy = rj' and Ti — 7^ and Tr < Tj^. 7-sequences are used to generate 
binary trees in B-order. Now we define 7-sequences as follows; 

DEFINITION 3 

For a given binary tree 7, the 7-sequence of 7 is the integer sequence (pj (1), pr (2),.. 
Pt(\T\ — 1)) where pr (0 is the number of internal nodes O written before the leaf i in 
the Polish notation of 7. 

Theorem 1 (Pallo & Racca 1985), An integer sequence fpi, p 2 ,..., Pn) is the 7- 
sequence of a binary tree with n internal nodes if and only if 

(1) for all z = 1, 2,..., n, p/ < p/+i, 

(2) Pn = n, and 

(3) for all / = 1, 2,..., n, p; > i. 

Theorem 2 (Pallo & Racca 1985). Given two binary trees 7 and T' such that |71 = |7'|, 
7 and T' are in B-order if and only if pr is lexicographically less than pj/. 

The algorithm proposed here is based on the sequential generating algorithm in Pallo & 
Racca (1985). It is worth reproducing the same here. 

Generating algorithm 

Begin with (pi, p 2 ,.p«) = (1,2, ...,n) 
while i = m 2 ix{k\pk < n) exists do 
Pi = Pi+\ 

for j = / -f 1 to zi do pj = max(p/, j) end do 
end do 

We consider a linear PARBS with n processors. Each processor has registers B,C,D and 
I. B and C are used to store boolean values. D is used to store 7-sequence elements and / 
is for index i of the corresponding processor 7/. Initially assume that di = i and f = i for 
i = \,2,..., n. A parallel algorithm for generating 7-sequences in lexicographic order 
is given in algorithm 3. 

Algorithm 3. Parallel generation of 7-sequences. 

Step L Set di = i and /, = i and store them in registers D and I of each processor 7/, 
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Step S.lfbi = 1 then stop. 

Step 4. Each processor P/ sends a copy of h/ to processor P/_i, say c/_i, stored in C, for 
i = 2, 3,..., n. 

Step 5. Each processor P/, for which h/ =1, connects port L to port R. 

Step 6. Processor P/, for which hi - a = I, sets di ~ di^\ and broadcasts di to processors 
P;+i, P;+2> • • •» Pn, Stored in register D. 

Step 7. Each processor P/ for which sets dt with di, if di > Ii, and li otherwise. 

Step 8. Each processor P/ writes di,i = 1,2, ...,n (next P-sequence). Then go to step 2. 

□ 

Note that there is a unique processor P/ for which bi • as the sequence bi, i = 
1,2, .. .,n will be of the form 0,0,...,0,1,1,..,,! which follows from the definition of 
P-sequence. It takes only constant time to produce the next P-sequence from the previous 
one. 


6. Parallel generation of binary trees using rotational admissible codewords 

A rotational-admissible codeword is a sequence of integers obtainable by a rotation-based 
coding method. A codeword (xq,X[,..., x„_i) is rotational admissible if xq = 0 and for 
each / = 1,2,..., n - 1, we have 0 < .x/ < x:, _] -f 1. It has been shown that there is a one 
to one con'espondence between codewords and binary trees (Zerling 1985). In this section 
we shall present a parallel algorithm based on Makinen’s (1991) non-recursive algorithm, 
which is worth reproducing. It uses an array jc containing the codewords to be generated, 
and jc is initialised by zeroes. Basically the algorithm adds one to the previous codeword. 
Variable pos shows the position of x in which we perform the addition, and it is initialized 
to contain the value n — 1. 

Procedure non-recursive Enumerate 
begin 

while x(0) = 0 do begin 
output codeword; 
if X (pos) < x(pos-l) then 
jc(pos) = x(pos) -t- 1 
else begin x(pos) = 0; pos = pos — 1; 
while x(pos) = x(pos-l) -f 1 do 
begin x(pos) = 0; pos = pos-1 end; 
x(pos) = x(pos) -I- 1; pos = n - end 
end 
end; 

Consider a linear PARES with each processor having registers A, B,C and X.X is used 
^ to store the components of the codeword. Register A is also used to store codewords. B 
and C are used to store boolean values. The algorithm is given below. 

Algorithm 4. Parallel generation of rotational-admissible codewords 

Stej? 1. Each processor P/ sets x,- = 0, for 0 < z <n — \. 



Step 2. If XQ = 0, then each processor Pi writes x/, for 0 < / <n — 1. Otherwise stop. 
Step 3. Processor P„_2 sends a copy of Xn-2 to processor P«_i, received in register A. 
Step 4. If x„_] < , processor P„_i sets x„_i = Xn-\ + 1, and then go to step 2. 

Step 5. Each processor P/, 0 < / <n — 2, sends a copy of x/ to processor P/+i, received 
in register A. 

Step 6. Processor P„_i sets x„_i = 0. 

Step 7. Each processor Pi sets at = a/ + 1, stores in register A itself. 

Step 8. Each processor P,- sets bi with 1, if ai = x/ otherwise 0. 

Step 9. Each processor P/ connects port L to port R for which P/ = 1. Processor P„_i 
sends a 1 signal through this bus, which is received in register C of each processor in the 
bus. Disconnect all switches. 

Step 10. Each processor P, for which b, • c/ = 1 sets x/ = 0 and bt • c, = 1 sets x/ = x; +1. 
Then go to step 2. □ 

It is worth noting that there is a unique processor P/ for which bi ■ ci = 1. The above 
algorithm is repeated until xq = 0. It can be easily seen that the algorithm takes only 
constant amount of time to produce the next codeword from the previous one. 

7. Conclusion 

We have presented parallel algorithms for generating combinatorial objects combinations, 
subsets, and binary trees based on P-sequences and rotational admissible codewords. Our 
algorithms take only constant time to produce next object from the previous one. There 
are many combinatorial open problems for which we can design fast parallel algorithms 
by exploiting the flexible nature of PARES models. 
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Abstract. The problem investigated is of an elastic-perfectly plastic infinite 
plate containing two equal collinear and symmetrically situated sfi-aight cracks. 
The plate is subjected to loads at infinity inducing mode I type deformations at 
the rims of the cracks. Consequently, plastic zones are formed ahead of the tips 
of the cracks. The loads at infinity are increased to a limit such that the plastic 
zones formed at the neighbouring interior tips of the cracks get coalesced. The 
plastic zones developed at the tips of the cracks are closed by applying normal 
cohesive quadratically varying stress distribution over their rims. The opening 
of the cracks is consequently arrested. Complex variable technique is used in 
conjugation with Dugdale’s hypothesis to obtain analytical solutions. Closed 
form analytical expressions are derived for calculating plastic zone size and 
crack opening displacement. An illustrative numerical example is discussed to 
study the qualitative behaviour of the loads required to arrest the cracks from 
> * opening with respect to parameters viz. crack length, plastic zone length and 

inter-crack distance. Crack opening displacement at the tip of the crack is also 
studied against these parameters. 

Keywords. Plastic zone; mode I type deformation; opening mode deforma¬ 
tion; rims of cracks; crack-opening displacement. 

1. Introduction 

Work on crack problems was first started by Inglis (1913). The complex variable for¬ 
mulation was given by Kolosov (1935). This was further developed by Muskhelishvili 
^ (1953) and was successfully applied by him to solve the crack problems for cracks of 

different geometry. The treatment of crack problems for fracture of brittle materials was 
given by Cherepanov (1974) using complex variable technique. Dugdale (1960) proposed 
an elastic-plastic model, called “strip yield model” for a sheet containing a slit. The sheet 
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opening was arrested by applying closing normal yield point stress to the rims of plastic 
zones. Smith (1974) gave a general theory based on the cohesive zone model for structures 
in the vicinity of a crack tip. The model was modified by Harrop (1978) for cases when plas¬ 
tic zones were closed by a cohesive normal parabolic stress distribution. The case of two 
interacting equal collinear straight cracks in an isotropic elastic unbounded plate was dis¬ 
cussed by Viola (1983). Theocaiis (1989) extended to “strip yield model” to the case when 
an infinitely elastic, perfectly plastic plate is weakened by two collinear straight cracks. 

The present paper investigates the case of two equal, collinear, symmetrically-situated 
straight cracks weakening an infinite plate. The plate is subjected to uniform constant 
tension normal to the rims of the cracks, at the infinite boundary of the plate. Consequently, 
the cracks open forming plastic zones ahead of the tips of the cracks. The tension applied 
is increased to a limit such that the plastic zones at the two neighbouring interior tips of 
the cracks coalesce. The rims of plastic zones formed (at the interior and exterior tips of 
the cracks) are then subjected to normal cohesive stress distribution t^Gye, where t is any 
point on any of the plastic zones and Oye is yield point stress. Thus, opening of the cracks 
is arrested. 







2. Basic formulae 


Stress components Pxx^ Pyy and Pxy and displacement components Ux and Uy may be 
expressed in terms of two complex potentials (p (z) and Q (z), developed by Muskiielishvili 
(1953), as 

Pyy - i Pxy = 0 (z) + ^ (z) - (Z - z)4>'{z), (1) 

2m(m;c,.x + iuy,x)=K(p(z) -Q{z)-(z- z)(p'{z). (2) 

A bar over a function denotes its complex conjugate. A dash after a function denotes 
differentiation with respect to the argument, while a comma after a function signifies 
partial differentiation with respect to the subscript following it. The elastic constant /x 
denotes shear modulus and k = (3— 4v) for the plane strain case and k — (1 — v)/(l -f- v) 
for the plane stress case, v being Poisson’s ratio. 

Consider a homogeneous, isotropic elastic infinite plate in the xy plane containing 
n straight cracks L/ {i = 1, 2,...«) lying on the real x-axis. Dual problems of linear 
relationship are obtained using (1), when the rims of the cracks are acted upon by stresses 
P^y and P^y . The two Hilbert problems so obtained may be written as 

<^+(r) + £2-(/) = P+onL (3) 

+ R+CO = P~y - iP~y, (4) 

where L = under the assumption lim^_»o{y0^(^ + iy)} = 0. 

Superscripts -T and — indicate the limiting value of the function when any point t on 
the crack, other than end points, is approached from the positive y-plane (y > 0) and the 
negative y-plane (y < 0), respectively. 

The solutions of (3) and (4) for complex potentials 0 (z) and (z) may be written, as 
0(z)=0o(z) + (Pn(z)/X(z)), (5) 

^(z) = ^2o(z) + (Pn(z)/X(z)), '(6) 
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where 


Mz) 

^o(2) 


q(t)Xit)^ 1 

dr — 

z 2 


f p{t)X{t) f q(t)X^. 

/ 

JL t — Z Jl t - 

r p(t)X(t) ^^ _ r 
Jl t - z JL t - 


p(t)X(t) 
t - z 

(Too being the tension applied at infinite boundary, 

Pit) = + Py-y) " ^[P/v + P^yl 

1 , . 

JCVJ’ 


q(OX(t)^ , 1 

—dt + -o-oo, 
2 2 


tlit) = ^)Pyi-P,7y]-~lPj-y 


and 


■^(z) = n - zii)(z - , 

i = l 

ai and hi are the end points of the crack L/ and 


P„(z) = Coz'' +C,z'‘~' + 


+ Cn- 


(7) 

(8) 

(9) 

( 10 ) 


( 11 ) 

( 12 ) 


The constants Ci {i = 1,2, .. .n) are determined by the condition of single-valuedness 
of displacement at the rims of the crack and Cq is determined from the boundary condition 
at infinity. Stress intensity factor, Kj (t= K\ - iKi), at the crack tip ^ may be 
calculated, as given by Cherepanov (1974), from 

K] = K] — iK 2 = lim {(z - zi)^^^0(z)}, (13) 

Z~^Zl 

where 0(z) is obtained from (5). 


3. Formulation of the problem 

A homogeneous, isotropic, elastic-perfectly plastic infinite plate, in the xy-plane, is weak¬ 
ened by two equal, collinear and symmetrically situated straight cracks Li and L 2 . These 
cracks Li and L 2 occupy tlie ligaments [—h, —a] and [a, b] on the real jr-axis as depicted 
in figure 1. The plate is subjected to uniform uniaxial tension, (Too, parallel to the y-axis, 
at infinite boundary. Consequently, the cracks Li and L 2 open forming the plastic zones at 
the tips —h, —a, a and b. The prescribed tension at infinity is increased to the limit where 
the plastic zones developed at the tips —a and a get coalesced. Thus the entire ligament 
[—a, a] forms the interior plastic zone, Fi. The plastic zone r 2 ahead of the crack tip ~b 
occupies the ligament [—c, —b] and the zone Fs at the tip b occupies the ligament [b, cj. 
Each of the plastic zones Ff (/ = 1,2, 3) is closed by cohesive normal stress distribution 
t^Gye, consequently arresting the opening of the cracks. Yield point stress is denoted by 
aye and t is any point on any of the plastic zones. 


4. Solution of the problem 
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These two problems are appropriately derived from the problem stated in § 3. These are 
termed as problem I and problem 11. 

4.1 Problem I 

The configuration of the problem I is: an infinite, homogeneous, isotropic and elastic- 
perfectly plastic infinite plate, in the :ry-plane, contains a stress-free straight crack R\ 
(= r2 U LiUT \U L 2 UT 2 >) occupying ligament [—c, c] ofthe real x-axis. Uniform tension 
OoQ, parallel to y-direction, is applied at infinite boundary of the plate. The complex 
potentials 0] (z), (z) for this case may directly be written using Muskhelishvili (1953), 

as 


Ait ( ■ 7 \ — n -7 f •R' _1/2 


A\ 
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Figure 2. Variation of normalised load ratio versus crack length. 


Opening mode stress intensity factor, /<C[, for this problem, at the crack tip z = c is 
calculated by substituting 0i(z) for <p{z) from (14) into (13) and is given by Cherepanov 
(1974). 

k( = aoo(7tc)^^^ (15) 

4.2 Problem II 

A homogeneous, isotropic and elastic-perfectly plastic infinite plate in the jry-plane con¬ 
tains two equal collinear symmetrically situated straight cracks L i: {—b, —a] and Z- 2 : [a, b] 
lying on the x-axis (refer figure 1). The cracks Li and L 2 possess the plastic zones Fi, r 2 , 
r 3 ahead of the tips of the cracks. These plastic zones Fi, r 2 , r 3 occupy the region [—a, a], 
[~c, —b] and [b, c] respectively. The boundary conditions of the problem are the following. 

(i) The unbounded plate is stress-free at infinite boundary. 

(ii) The rims of the plastic zones F/, {i = 1, 2, 3) are subjected to tensile stress = 
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r} 0.20 
(U 


plastic zone ratio j(c-b)/2a| 

AAAAA 0.1 ' 

Q Q Q Q .P 0.2 
0.3 

**_*_*_* 0.4 


half inter-crack distance 

Figure 3. Normalised load ratio versus inter-crack distance. Crack length is one unit. 


Dual Hilbert problems are obtained in terms of potentials (p 2 (z) and ^ 2 (z) using (1) and 
boundary conditions (ii) and (iii) as 

[4^(0 + fi2Wl+ + IMO + ^2(‘)r = l^CTye, (16) 

[fc(0 - Q2(0r - - ^22(01“ = 0, onr = !7,i,rj. (17) 

The subscript 2 indicates that the functions refer to the problem II. 

Solutions of (16) and (17), are written using (5) to (12) and boundary conditions (i) and 
(iv), as 

'h(.l) = -Z^\(^-zAAi-izX(z)A^ (18) 

niX(z) |_\ 3 y 

= ^ 2 (z), 

= (zt/2) + sin"^(n/c) — sin~^(h/c), 

_i zic'^ - 7T 

A 2 = tan -—^5 -— tan —-, 

a{z^ — £:^)V2 ^(^2 _ ^2)1/2 2 


where, 


X(Z) = (C^-Z^)V2. 
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crack length 

Figure 4. Variation of normalised crack opening displacement versus crack length at the 
interior tip of the crack. 

Opening mode stress intensity factor for problem II, /ifp, is obtained substituting the 
lue of 02(z) for 0(z) from (18) into (13) and simplifying, one gets 

Plastic zone size and crack opening displacement 

astic zone size for original problem (stated in § 3) is obtained by superposing two stress 
tensity factors obtained for problem I and problem II. Using the fact that the stresses 
main finite at every point of the plate leads to the nonlinear equation. 

^[1 - (2/7r){sin ^(b/c) - sin ^(afc)]] — (Joo((^ye — 0, (21) 

r determining c when values of Ooo/Oye, a and b are prescribed. Finally the plastic zone 
nigth is calculated by |c — ^|. 




Figure 5. Variation of normalised crack opening displacement versus crack length at the 
exterior tip of the crack. 


where E is Young’s modulus, Im[ ] denotes the imaginary part of the quantity in the 
bracket. The complex potential OCz) is obtained by superposing the non-singular terms 
of 01 (z) {from (14)} and 02(^) (from (18)}. Substituting the value of <I>(z) thus obtained 
in (22) and integrating, one obtains the crack-opening displacement (COD) at the exterior 
tipz = b as 

Uy = —{2(7yecb/7i:E)Ai{b{l + c sirT^ {b/c)}, (23) 

and at the interior tip z = a the COD is given by 

Uy = —{IcTyeCa/TVE)Ai{a{l — A csm~^(a/c)}. (24) 

6. Illustrative example 

A qualitative study has been carried out to find the load ratio required to prevent the cracks 
formed from opening, as the crack length and inter-crack distance (the distance between 
the two neighbouring tips of the two cracks) is increased. 

Normalised load ratio has been plotted against increasing crack length in figure 2. 
Normalising factor of stress intensity factor (SIF) is 10. It is observed that for longer 
cracks greater load is required as expected to close the crack. Also for a fixed crack length. 
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Figure 6. Normalised crack opening displacement at interior tip with respect to inter-crack 
distance. The crack length is taken as one unit. 


if plastic zone ratio (the ratio of outer plastic zones/inner plastic zone) is increased then a 
higher load still is required to close the crack. The two cracks were kept apart at a fixed 
distance. 

Figure 3 depicts the variation of load ratio as inter-crack distance is increased. The 
crack length is unity. It may be noted that, for a fixed plastic zone ratio, as the cracks move 
apart greater load is required to close them. This is because the effect of the cracks on one 
another diminishes. 

The variation of crack-opening displacement at the interior tip, z = a of the crack L 2 , 
versus increase in crack length is depicted in figure 4. The normalising factor of COD 
is (o-ye/lOE). The same variation is plotted at the exterior tip z = of the crack L 2 in 
figure 5. It may be noted as crack length increases the cracks open more at both the tips, 
for a fixed plastic zone ratio. But it is interesting to note the difference in the variation 
patterns of COD at the inner and outer tips of the crack. It may be noted that at the two 
interior tips of the COD, the pattern is such that the plastic zones get coalesced. 





Figure 7, Normalised crack opening displacement at exterior tip with respect to inter-crack 
distance. The crack length is one unit. 


7. Conclusion 

The Dugdale model (1960) was modified by Harrop (1978) for a single crack when a 
parabolic load is distributed over its plastic zones. The present paper modifies the extended 
Dugdale model given by Theocaris (1989) for two straight cracks in an elastic-perfectly 
plastic infinite plate. Here, the quadratically varying cohesive stress distribution closes the 
rims of the plastic zone. 

The results obtained may be applied to a plate-type structure in which two similar straight 
cracks develop in the vicinity of each other and their propagation under pressure causes 
coalescence in the plastic zone, viz. aeroplane wings, turbine blades, stretched sheets etc. 

The dimensions of the plate should be about five times larger than the total length of the 
two cracks and the inter-crack distance should be such that the stresses at the crack tips do 
not affect the stresses at the boundary of the plate. 


The authors are grateful to Prof R D Bhargava, formerly of the Indian Institute of Tech¬ 
nology, Bombay for his invaluable suggestions and encouragement during the preparation 
of this paper. We thank the referees for their valuable suggestions which have improved 
the readability of the paper. 
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List of symbols 


a, b, c 

E 

Ki 

Lj (j = 1, 2) 

Pij {i, j = x, y) 
Pn{z),X(z) 

Hi (i ~x,y) 
z = X 4- iy 
T; (j = l,2,3) 
P 

V 

^ye 

<Xoo 


tips of the cracks; 

Young’s modulus; 

stress intensity factor for mode I type deformations; 
cracks; 

stress components; 
complex functions; 
displacement components; 
complex variable; 
plastic zones; 
shear modulus; 

Poisson’s ratio; 
yield point stress; 

tension applied at infinite boundary; 
complex potentials. 
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Analysis of deformed microstrip resonator using the finite 
element method 

A S CHAUDHARI and P B PATE.* 
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Abstract. Microstrip resonator with shape deformation has been analysed us¬ 
ing the finite element method (FEM). Keeping the front surface of the micro strip 
resonator fixed, the back surface is deformed. Five cases of the deformation of 
the back surface are considered: (i) the left vertical side of the back surface is 
shifted inward; (ii) both vertical sides of the back surface are shifted inward; 
(iii) the right vertical side of the back surface is shifted outward; (iv) both ver¬ 
tical sides of the back surface are shifted outward; (v) left vertical side of back 
surface is shifted inward and the right vertical side of it is shifted outward. 
Variation in cutoff frequency for the TMon, TMno, TMm, TM012, TMn2 
and TM210 is observed. 

Keywords. Microstrip resonator; finite element method. 

1. Introduction 

Finite element method (FEM) is a powerful tool for the approximate solutions of differential 
equations governing diverse physical problems. It can tackle problems of different branches 
in physics and it can be used for any complicated geometry or domain of regular or irregular 
shape. The method can deal with the problems described by coupled equation. The method 
is useful for inhomogeneous and anisotropic media also. 

The resonant frequencies of open microstrip ring resonators are determined by Wolff 
& Menzel (1975) and Pintzos & Pregla (1978). The microstrip resonator with equilateral 
triangular patch is studied by Wolf & Knoppic (1974), Helszajn et al (1979), Lyon & 
Helszajn (1982). The triangular and rectangular patch microstrip resonator has been anal¬ 
ysed by Kalamse & Patil (1994) using the finite element method. 

In this paper we have analysed the microstrip resonator with shape deformation for five 
cases: 

(i) the left vertical side of the back surface is shifted inward; 

(ii) both vertical sides of the back surface are shifted inward; 
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(iii) the right vertical side of the back surface is shifted outward; 

(iv) both vertical sides of the back surface are shifted outward; 

(v) left vertical side of back surface is shifted inward and the right vertical side of it is 
shifted outward. 


2. Statement of the problem 


Consider the rectangular microstrip resonator bounded by six faces Bl, B2, B3, B4, B5, 
B6, as shown in figure la. 

The two side surfaces Bi, B 2 and front and back surfaces B3, B4 are magnetic walls. 
The top and the bottom surfaces are electric walls. 

The electric field E within the resonator will satisfy Maxwell’s equations, 

Curl Curl E -= 0, (1) 

where 

grad div E - V^E - K^E = 0. (2) 

Since the medium is charge free, div E = 0. 

V^E + K^E = 0. (3) 

The eiectiic field within the substrate has only the Z component and the magnetic field 
has the X and Y components. The tangential component of the magnetic field at the edge 
is negligible. 



Figure 1. (a) Microstrip resonator without deformation, (b) Left vertical side of the back 

surface is shifted inward, (c) Both vertical sides of the back surface are shifted inward, (d) 
Right vertical side of the back surface is shifted outward, (e) Both vertical sides of the back 
surface are shifted outward, (f) Left vertical side of the back surface is shifted inward and 
right vertical side of the back surface is shifted outward. 
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div grad Er + E^ ~ 0. 


(4) 


The fields within the resonator corresponding to TM modes will be generated by the 
nation, 

+ K'^E^ = 0, (5) 


>jected to the boundary condition 
dE, 


dn 


Bi,62,63^4 


= 0 , 


£^zIB5,B6 = 0’ 

lere Ez is the z component of E, and d/dn represents nonnal derivative. 


( 6 ) 

(7) 


Variational formulation 

get the expression for the functional n in variational formulation, multiply (4) by some 
light function V and integrate it over the domain of the resonator, 

n= f f f Vdivgrad£2d^^ +/* f f VEzdQ. (8) 

J jQ J J JnJ 

ing the vector identity 5 div A = divC^A) — (grad S) • A for the first term, (8) becomes 


U= f f f div (V grad dr2-- f f /(grad V) (grad Ej-) d£^ 

j JqJ J JqJ 

+ K^ f f f VEzdQ. 

J Jo, J 

pplying Gauss Divergence theorem to the first term in (9), we get 

n = - f f f (grad V) ■ (grad Ez)da + K'^ f f f V Ez d^ 

J J^J J JqJ 

J J V grad Ez • rids. 

sing (6), the last term in (10) will vanish. 


(9) 


( 10 ) 


ble 1. Variation of cutoff frequency for different modes for the inner deformation of one vertical side. 


ift 

cm) 



Frequency (GHz) 



TMoii 

TMiio 

TMiii 

TM 012 

TM 112 

TM 210 

5 

13.607 


15.005 


17.641 

17.906 

0 

13.606 

14.046 

15.035 

16.470 

17.663 

17.992 

5 

13.605 

14.077 

15.066 

16.470 

17.685 

18.073 

0 

13.604 

14.107 

15.100 

16.470 

17.707 

18.148 

5 

13.603 

14.139 

15.136 

16.470 

17.728 

18.219 

0 

13.601 

14.170 

15.175 

16.470 

17.749 

18.287 

5 

13.599 

14.202 

15.217 

16.471 

17.770 

18.351 

to 

13.597 

14.233 

15.261 

16.473 

17.792 

18.413 








Table 2. Variation of cutoff frequency for different modes for the inner deformation of both vertical 


sides. 

Shift 



Frequency (GHz) 



L (cm) 

TMoii 

TMiio 

TMni 

TM 012 

TM 112 

TM 210 

0.05 

13.607 

14.045 

15.035 

16.469 

17.667 

17.990 

O.IO 

13.607 

14.103 

15.104 

16.468 

17.719 

18.130 

0.15 

13.607 

14.161 

15.184 

16.466 

17.778 

18.269 

0.20 

13.608 

14.218 

15.277 

16.464 

17.842 

18.379 

0.25 

13.608 

14.272 

15.386 

16.461 

17.915 

18.477 

0.30 

13.609 

14.324 

15.513 

16.458 

17.998 

18.567 

0.35 

13.611 

14.372 

15.659 

16.455 

18.093 

18.650 

0.40 

13.612 

14.417 

15.825 

16.451 

18.203 

18.728 


By substituting V = £* and changing the sign, (10) becomes 

n = i / / / (grad E,) . (grad £») j j E,E* dS2. 

^ is introduced since FI is bilinear functional. 

The first variation SH is given by, 

50 = 5! j j^l(VE.)-iVE*)dn-K^- j j I E,E*dQ . 

For n to be stationary, <511 should be minimum. 


( 11 ) 


( 12 ) 


4. Discretization 

According to the finite element method (Reddy 1986; Akin 1988), the volume of the 
resonator is divided into hexahedral elements with 20 nodes. The mapping functions as¬ 
sumed for these elements are quadratic in nature. The functional over an element is given 
by 

n" = - ^-K^J2[ElflT‘]{El], (13) 

ele ele 


Table 3. Variation of cutoff frequency for different modes for the outer deformation of one vertical side. 


Shift Frequency (GHz) 


L (cm) 

TMoii 

TMiio 

TMiu 

TM 012 

TM 112 

TM 210 

0.05 

13.607 

13.957 

14.950 

16.470 

17.595 

17.719 

0.10 

13.606 

13.930 

14.925 

16.469 

17.570 

17.622 

0.15 

13.604 

13.903 

14.902 

16.469 

17.551 

17.515 

0.20 

13.602 

13.877 

14.879 

16.469 

17.522 

17.416 

0.25 

13.599 

13.853 

14.857 

16.468 

17.495 

17.315 

0.30 

13.595 

13.831 

14.836 

16.467 

17.468 

17.214 

0.35 

13.590 

13.810 

14.816 

16.465 

17.438 

17.116 

0.40 

13.583 

13.792 

14.797 

16.463 

17.410 

17.019 
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ble 4. Variation of cutoff frequency for different modes for the outer deformation of both vertical 
les. 

ift Frequency (GHz) 

(cm) TMoii TMiio TMm TM 012 TM 112 TM 210 


)5 

13.607 

13.928 

14.926 

16.469 

17.574 

17.620 

10 

13.607 

13.872 

14.882 

16.468 

17.533 

17.416 

15 

13.607 

13.817 

14.843 

16.465 

17.494 

17.211 

>0 

13.607 

13.765 

14.808 

16.461 

17.457 

17.014 

>5 

13.607 

13.714 

14.777 

16.453 

17.422 

16.826 

50 

13.606 

13.666 

14.749 

16.437 

17.388 

16.656 

55 

13.606 

13.621 

14.723 

16.388 

17.355 

16.528 

10 

13.606 

13.577 

14.700 

16.269 

17.323 

16.479 

here 

5,7 = / 

rdFi dFj dFi 
_ dx dx dy 

dy dz dz . 

dx dy dz 


(14) 

id 

TlJ=f 

FiFj dx dy dz. 




(15) 


Here F/ is the mapping function due to the ith node and integrations are over the mesh 
ement surface. 

The functional for the whole region is given by 


n = - 'jK^{E,f[T][E,}. 


(16) 


le condition that variation of FT must be minimum i.e. zero, gives 

- K'^[T]{E,} = 0. (17) 

quation (17) is the matrix equation to be solved to get eigenvalues and eigenvectors. 


Numerical calculations for microstrip resonator 


microstrip resonator of sides 2.4 cm x 1.2 cm x 2.8 cm is considered. The deformation 
ir each case is shown in figure l(b-f). 


ible 5. Variation of cutoff frequency for different modes for the inner deformation of one vertical side 
id outer deformation of other vertical side. _ 


lift Frequency (GHz) 


(cm) 

TMoii 

0 

TMm 

TMo12 

TMn2 

TM 210 



13.987 

14.976 


17.615 

17.816 



13.991 

14.973 

1.6.471 

17.605 

17.820 



13.996 

14.969 

16.472 

17.588 

17.827 



14.004 

14.964 

16.474 

17,565 

17.836 



14.013 

14.956 

16.476 

17.537 

17.848 



14.025 

14.948 

16.479 

17.505 

17.862 



14.037 

14.938 

16.482 

17.469 

17.879 














Figure 2. Variation of cutoff frequencies 
for different modes for the inner deforma¬ 
tion of one vertical side of back surface. 



Deformation in cm 



0.15 0.2 0.25 0.3 0.35 0.4 

Deformation in cm 


Figure 3. Variation of cutoff frequencies 
for different modes for the inner deforma¬ 
tion of both vertical sides of back surface. 
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Figure 4. Variation of cutoff frequencies 
for different modes for the outer defonna- 
tion of one vertical side of back surface. 
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Figure 6. Variation of cutoff frequencies 
for different modes for the inner deforma¬ 
tion of one vertical side and outer deforma¬ 
tion of other vertical side of back surface. 


(i) First case: Left vertical side of the back surface of the microstrip resonator is shifted 
inward by 0.05 cm. 

(ii) Second case: Both vertical sides of the back surface of the microstrip resonator are 
shifted inward by 0.05 cm. 

(iii) Third case: Right vertical side of the back surface of the microstrip resonator is shifted 
outward by 0.05 cm. 

(iv) Fourth case: Both vertical sides of the back surface of the microstrip resonator are 
shifted outward by 0.05 cm. 

(v) Fifth case: Left vertical side of the back surface is shifted inward by 0.05 cm and the 
right vertical side is shifted outward by 0.05 cm. 

In each case, for every shift, eigenvalues and eigenvectors are obtained. TMqi i, TMi lo, 
TMiii,TMoi2,TMi 12 and TM 210 modes are identified using field plots. These eigenvalues 
are the values of ay^/iQSQ. From this, the corresponding cutoff frequencies are calculated 
in gigahertz. 

For every shift, in each case, variation of cutoff frequency with the shift for the above 
modes is given in tables 1-5 and shown graphically in figures 2-6. 


6. Conclusions 

For the inner deformation of one or both vertical sides of the back surface, the cutoff 
frequencies increase with the shift for all the modes, except for TMqi 1 and TM012 where 
they are almost constant. 
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?or the outer deformation of one or both vertical sides of the back surface, the cutoff 
quencies decrease with the shift for all the modes. 

Por the inward deformation of one vertical side and outward deformation of the other 
tical side, values of cutoff frequency for the TMqi i, TM] n and TMi 12 modes decrease 
h the shift, while for the TMno, TM 012 and TM 210 modes they increase. 
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effect of surface stresses on surface waves 
n elastic solids 
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Abstract. This paper deals with the propagation of surface waves in homo¬ 
geneous, elastic solid media whose free surfaces or interfaces of separation are 
capable of supporting their own stress fields. The general theory for the prop¬ 
agation of surface waves in a medium which supports surface stresses is first 
deduced, and then this theory is employed to investigate the particular cases of 
surface waves, viz. (a) Rayleigh waves, (b) Love waves and (c) Stoneley waves. 
It is seen that the Rayleigh waves become dispersive in nature; and, in case of 
low frequency with residual surface tension, a critical wavelength exists, below 
which the propagation of Rayleigh waves is not possible. This critical wave 
length is directly proportional to the surface tension. Some numerical calcu¬ 
lations have been made in the case of Love waves and conclusions have been 
drawn. 

Keywords. Surface waves; effect of surface stresses; Rayleigh waves; Love 
waves; Stoneley waves; residual stress. 

. Introduction 

lurface waves play an important role in the science of earthquakes. To match the ac- 
aal situation in a given problem, three types of surface waves are generally introduced 
/hich are classified according to their nature of displacements (Dey & Sengupta 1978; 
lhandrasekharaiah 1986; Das etal 1992; Pal etal 1996). Again it is known that the phys- 
:al properties of bodies in the neighbourhood of the surface are sensibly different from 
tiose in the interior. Thus the boundary surface may be regarded as a two-dimensional 


propagation. The Rayleigh wave velocity equation has been obtained as a particular case. 
Following Chandrasekharaiah (1987), the effect of surface stresses on Rayleigh wave prop¬ 
agation has been discussed. Next we discuss Love wave propagation with surface stress 
on its free boundry, with some numerical computations and conclusions. Wave velocity 
equation for Stoneley waves has also been deduced. 


2. Basic equations 


Let us consider two homogeneous, isotropic, elastic solid media M\ (lower medium) 
and M 2 (upper medium) with different material constants. We also consider that a two- 
dimensional elastic layer is present at the surface of separation of the two media. We take 
an orthogonal cartesian frame of axes Ox\X 2 X 2 , O is taken on the interface and 0 x 3 is 
positive in the downward direction. As in Bullen & Bolt (1985), here also the wave travels 
in the Ox\ direction. The disturbances of the particles on lines parallel to 0x2 vibrate in 
phase, which means all partial derivatives with respect to X 2 are zero. 

The equation of motion in the absence of body forces is 


/X V^u -h (A. -t- jLt) grad div u = 



( 1 ) 


In the above Navier’s equation of classical elasticity u (u\, U 2 , W 3 ) is the displacement 
vector; A., /x are Lame elastic constants; p is mass density and t represents time. To in¬ 
vestigate plane deformation parallel to the X 1 X 3 plane we introduce the displacement 
potentials P, Q related to the displacement components wi (xj, X 3 , t) and M 3 (xi , X 3 , t) by 
the equations. 


dP dQ 

= 7-7—, «3 

0 X 1 0 x 3 


Introducing (2) in (1) we obtain, 

i2 


r.2 1 9^ 

- 


v2-iL 

dt^ 

V. . 

V72 1 92 '' 

- 


F=0, 

2=0, 

M2 =0, 


dP ^ dQ 
9x3 


( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 


where a = (A -f IjiJ and b = (/x/p)^/2 gj-g velocities of dilatation and distortion 
respectively. Befitting the actual situation of the problem, we seek solutions to (3)-(5) 
as 


[P, Q, «2] = [Hxz), Q(X3), ii2fe)] 


( 6 ) 
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Inserting (6) in (3)-(5), we get 


- V' 


d^P 

2 

d^M 2 2 

dx^ ^ 


1 - 


Tj^a^ 


ifiyi 


r 




P = 0 , 


( 2 = 0 , 


M2 =0. 


(7) 


Now Pix 2 ,), Qixf), ^ 2 (^ 3 ) describe surface waves and as such they must be vanishingly 
small as JC 3 —^ 00 . Hence the solution of (5) for the medium M\ may be taken as 

P = A\GXp(-rimiX3)t^^^^^ 

Q — Al exp(-r?m22C3) 

U'l — A 3 exp(-77m2X3) e' ^0 ^ 


( 8 ) 


where 


m 


1 = (1 - m 2 = (1 - 


c = 




c C 

r = -, s = 
a o 


(9) 


wavelength = litlr], wave velocity = ^ being aknown constant while 77 is an unknown 

constant; and, A\, A 2 , A 3 are arbitrary constants. 

Analogous solutions with primes (for the region 0 < X 3 < 00 ) 

p' = A\ &x-p{r}m\x3) 

Q'= A2exp(77m2X3) 

U 2 = A 3 exp(77m2X3) 

are obtained for the upper medium M 2 of the material, where 
' 2 ) 1 / 2 ^ 


( 10 ) 


c = -, 

77 


r' = 


a 


7 ’ 




c 

T' 


( 11 ) 


3 . Boundary conditions and solutions 

We assume that the plane X 3 = 0 is a material layer which 

to the lower medium and is a two-dimensional elastic 1 ,^ 

of supporting its own stress represented by surface stress tensor Em’ ^ 

(Chandrasekharaiah 1987), _ i 9 

\Sia[<y + (Xo+o)Uy,y] + llOUt.a + (M-<^)‘*o.i' ’ ’ 
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Here Xq, /xq ai'e the Lame moduli of the material boundary and o is the residual surface 
tension on the layer X 3 = 0. Following Gurtin & Murdoch (1976) we use the surface 
elasticity tensor. The forces on the bounding surface are governed by surface stress tensor 
Ylia- Given a smooth oriented curve y in ^^3 = 0 with positive unit normal Ui, Ylia is 
the force per unit length of y by that part of the boundary into which ria is directed, upon 
that part of X 3 =0 away from which ria points. Dimensions of Xq, /xq, cr ai'e the same 
(N/m). 

The boundary conditions are 


(i) 


T/3 


+ E 

icx,a 


02 


Ui 


-PQ- 


9r2 


= T 




on X 3 = 0 , 


where po is the mass per unit surface area (kg m“2) of the layer (Gurtin & Murdoch 1976), 
and tij and are the stress tensors in the interior of the media M] and M 2 respectively. 
The conventional stress tensor r /3 has the unit of force per unit area and the new stress 
tensor has the unit of force per unit length, r/y obeys the law (Gurtin & Murdoch 
1976), 


Zlj _ X^ijUk^k + + My^j)- 

The boundary conditions are 


(13) 


/X 2 


d^P d^Q d^Q 


9;ci9a:3 


+ 


dxj 


+ 


92 

(Aq + 2 pto)~^ 
dxf 


PO 


9f2 


/ 9P 

\9x] 


^ 92 ?' d~Q' 

2—4- ^ 



8 ^' 


IX s^p I 


PO 


9^2 


V9jc3 dxi 


a^Q’ d^p'\ , ix' a^p' ( 9 ^ 


P 


y9xi9x3 

92^2 

dx] 


9x; 


y 2 3^2 


9x2 




9x3 


(14) 


M 2 


(ii) The displacement components must be continuous on the surface of separation X 3 
at all places and times. Therefore 


9P dQ _ dP' dQ' 

9xi 9x3 9-^1 9x3' 

^_^^_9P' dQ' 
9x3 9xi 9x3 9xi’ 

M 2 = M^. 


(14a) 



Applying boundary conditions (14) and (14a) to (8) and (10) the following system of 
equations is obtained, 

(2m 1 + r]F)Ai — i{2- — 4- r}m 2 F)A 2 

4— —m\A\-{-i — (2 — s'^)A2 = 0, (15) 

IX IX 

(2 — 4- rjm\H)A\ — i(2m2 4- rjH)A 2 

- —(2 - j'^)A; - n—m'.A'y = 0, 

IX IX 

AI — im2A2 — A\ — /m2^2 “ 
m 1 A 1 — / A 2 4- m j A\ -\-iA 2 — 0, 

(iXQ- poC^ 4- -m 2 ) A 3 4- —m 2 A 3 = 0, 

V r] J V 

A 3 - A '3 = 0, 

where we have taken 

1 . 

-(Xq + 2/20 - PQC ) = F, 

^ (16) 

-{a -poc^) = H. 

The last two equations of (15) yield 
A 3 = A '3 = 0 , 

which implies that there is no propagation of displacement U 2 - Eliminating the constants 
Ai, A 2 , Aj, A 2 from (15) we obtain 


where 


A = det[aj;] = 0, z, 7 = 1, 2, 3,4, 


an = 2m\ 4- r]F, a \2 = 2-s^-\- rim 2 F, 

I F /r) / 2 \ 

<213 = 2—m,,<2i4 = —{Z — s ), 

IX II 

a2i=2- + rjmiH, <222 = 2m2 4- 

f' .0 ./ 2 n 

^23 — (2 ^ 

IX 

iF' f 

a24 = "2—m 2 , 

IX 

asi = 1, a32 = m 2 , <333 = —1, <^34 = —m^, 
041 = mi, a42 = 1, ^43 = m'l, a44 = 1. 


Equation (17) represents the wave velocity dispersion equation for interface waves in 
elastic solid media under the influence of surface stresses, where the plane of separation 
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X 3 = 0 is a material boundary. The equation depends on the particular value of rj which 
indicates dispersive nature caused by the presence of surface stresses. If the surface of 
separation is free of surface stresses, F = H = 0; and at once the classical equation of 
general surface waves is obtained from (17). 


4. Particular cases 


4.1 Rayleigh waves 


As a particular case of the general surface wave discussed in the previous article we may 
study Rayleigh waves under the influence of surface stresses in the light of the investigation 
done by Chandrasekharaiah (1987). Here the upper medium (M 2 ) is replaced by vacuum. 
Applying the boundary conditions 


d^u\ 


^13+E-/’0^=0. 


la,O' 


1^33+ 2 ^ -PO-^=0, 
3a,a 


we get 

(2mi + r]F)A\ + (2 - 5 ^ 4- r)m 2 F)A 2 = 0, 
(2 — 5 ^ + r}miH)A\ + (2m2 + r}H)A2 = 0. 


Ehminating the constants Ai, A 2 the Rayleigh wave velocity equation may be obtained as 


2mi + rjF 
2 — 5 ^ + rjmiH 


2 — + r]m2H 

11712 + VH 


= 0 . 


The determinantal equation on simplification gives 

(1 — m\m 2 )FHt]^ + {m\H 4- m 2 F)s^r} + 4mim2 — (2 — s^)^ — 0. (18) 


This equation is in agreement with the corresponding equation obtained by Chandrasekhara¬ 
iah (1987) in the absence of voids. Again in the case of conventional stress-free boundary, 
(18) becomes 


4mim2 — (2 — j'^)^ = 0. (19) 

When surface stresses Ylia ^^e from surface tension, we have as a special case the 
conditions Xq = /hq = po = 0. In such a situation F — 0, H = o jpu and hence the 
equation (18) becomes 

rids^il - = )Lt[(2 - - 4(1 - 5^)^/^(l - 


( 20 ) 
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where 

r^ = b^la^. ( 21 ) 

This equation is in agreement with the corresponding equation obtained by 
Chandrasekharaiah (1987) in the absence of voids. In the absence of residual surface 
tension, we take a = 0 to obtain the classical Rayleigh wave velocity equation (19). In 
this case, (20) has one root = 5 o in the open interval (0,1). If o' 0, (20) has a real root 
in the interval (^o, 1), provided rj < rjc where rjc = /^/[(j(l — Tq)^/^]. Again if 77 = ijc 
we see that = 1 is a root of (20). Lastly, the said equation has no real root if rj > rjc. 
Thus we have a critical wavelength, Lc = 27Tlrjc, below which Rayleigh waves cannot 
propagate at all under the influence of surface stresses. The relation, 

i(il2jt)Lc = o-(l - {fp- 

reveals that critical wavelength varies directly with residual surface tension which is a 
factor of surface stress. 

4.2 Love waves 

In order to consider Love waves we shall assume that the upper medium (M 2 ) is of finite 
thickness H and the lower medium (Mi) is semi-infinite. Let the origin O of the orthogonal 
cartesian frame (9;r 1 x 2 x 3 be on the interface, Ox{ and ( 9 x 3 are along the interface and 
vertically downwards into Mi respectively. For Love wave propagation along xi axis (see 
figure 1 ), 


Ml = W3 = 0, 

U2 = U2{x\,X'i, t). 


Here equations of motion are 



^ 2 = 0 . 


( 22 ) 



Figure 1, 


Love waves and their propagation. 


me soiuiiunii ui cue 


112 = A \ exp(—??m 2 Jr 3 ) 

ii'^ = {A 2 exp(—^?m 2 X 3 ) + A 2 &xp{rjm 2 X 2 )} 


where 


m2 = (l -5^)'/^, 

mi = (l 

^ C 

C = ^, ^ = -, 

rj o 


C 


The boundary conditions are: 

(i) ii 2 is continuous on the boundary surface of separation a :3 = 0 ; 

(ii) T 23 is continuous on the boundary surface of separation X 3 = 0 ; 
(hi) since the boundary surface X 3 = - // is free of external loads 


(23) 


(24) 


we have (Gurtin & Murdoch 1974-75) 

. ^ 

2a,a 

Applying the above boundary conditions to (23) and then eliminating the constants A 1 , 
A 2 , A 3 we get the wave velocity equation for Love waves under the influence of surface 
stresses in determinantal foim as, 


1 

lJim2 

ix'm'n 2 

-^ + (mo - Poc ) 

V 


-1 

~ 11 '1712 


ix'm2 


lJi'm2 


- (MO - Poc“) e ^'^2^ 


0 . 


This determinantal equation on simplification gives 


r]H = 


1 

Cic/b')^ - 1 }V 2 


X tan ^ 


KcVfc'^) - + (ri/n'Kixp - PQC^)) 

(c2/i'2) _ 1 _ (ix/ix')ll - (c2/fe2)]l/2 . (ri/^')(^o - 


(25) 


when the surface stress effect is neglected (/ro = Po = 0), (25) reduces to 


{(c2/(.'2) - 1)1/2 


{(c2/fc'2) - 1)1/2 


(26) 


which is the classical Love wave velocity equation. From (25) we observe that Love wave 
propagation is possible if 


1 < {c/b') < [(m/p)/(m 7 p')]^^^- 


(27) 
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Denotes the classical case where the surface stresses are not present 


4.3 Numerical calculation of Love waves 
For numerical calculations we take 

/X = 3.00 X 10^ N/cm^ = 5.00 x 10^ N/cm^ /xq = 6.47 x 10^ N/cm, 
p = 2.72g/cm^, p'= 9.89g/cm^, po = 3.40g/cm^. 

Table 1 gives the values of rjH for different values of C jh' taking rj = 1, 2, 3. It also gives 
the values of r)H for different values of C/b' in the absence of surface stresses (denoted 
by 0 in the first column). 

We observe that the Love wave velocity equation is independent of a and depends solely 
upon /xo and po. It is also observed that for a particular value of r], wave velocity decreases 
with increase of i]H. Again for a particular wave velocity, r]H increases with increase of 
rj. For comparison we give the values of r)H in table 1 for the classical case (denoted by 
0 ) where the surface sti'esses are not present. 


4.4 Stoneley waves 

Stoneley waves are a generalized form of Rayleigh waves propagating along the common 
boundary of Mi and M 2 . Equation (17) represents the wave velocity equation for Stoneley 
waves in elastic media with surface stresses. This equation after a little manipulation may 
be obtained in the form 

+ 5^ |(Cr + Z)S)(1 - Sff') + ^x^(CT' + DS')a - 5r) j ?7 - A(s) = 0, (28) 
where 

A(.s) = (1 - S'T') (2 - s^f - 4ST - 4—(1 - s^ - ST) 

+ (1 - ST) {(2 - T^s^)^ - 4S'ff - 4— - (1 - r^s^ - S'ff) 

I l^ 

-r|/ —(2 + 5r' + 5'7’), 

/X 


(29) 
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and 

S=clb, C—(T*/pb^, (T* = <7 — , 

D = r* = r — pQC^, r = lo + 2 ^ 10 , 

S' = {\- T' = {l- T3V)^/2, 

x\ =b/a, X 2 — b/b', rs = (30) 

For the classical case, i.e., when surface stress is absent, we take cr = F = po = 0, so 
that C = D = 0, (28) reduces to A(s) =0 which is equivalent to the classical Stoneley 
wave velocity equation. It is observed that the Stoneley waves under the influence of 
surface stress are dispersive in contrast with the classical situation for which waves of all 
frequencies propagate uniformly at the same speed. The equation A( 5 ) =0 was solved 
numerically by Koppe (1948) who concluded that the velocity of the interface wave falls 
between the values of the velocity of Rayleigh waves and of the transverse waves in the 
medium with greater acoustic density. Taking /x' 0 in (28) we get the Rayleigh wave 

velocity equation under the influence of surface stress as 

p'^CDil - ST) 4- s^iCT -h DS)r) - gis) = 0, (32) 


where 


g(s) = (2-s^)-4ST. (33) 

This result is in agreement with the result obtained in (18) and also that by Murdoch (1976). 
Further, when there is no surface stress, this equation reduces to the classical Rayleigh 
wave velocity equation g( 5 ) = 0. This equation has one real root s = iri the interval 
(0,1). If.? = satisfies the equation A(s) = 0, then 50 < transverse wave 

velocity in the medium with greater acoustical density. From (28) when CD 7 ^ 0 we have 


—s 


± 


(cr + D5)(l - S'T') + —xliCT' + D5')(l - ST) 




11' 

P' Jii 


{CT + DS)a - S'T') + —x^iCT' + DS')(l - ST) 


P 


+ 4A(^)CD(1-6'r)(l-5'r) 


1/2 


-^{2CD(i-^r)(i-5''rO}. 


(34) 


If C = 0, i.e. in the absence of residual surface tension for a specific value of s we have 
from (28) 
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Again when D = 0 i.e. in the absence of surface tension at some value of 5 we have 


rj = A (s) 


CT(1 - S'T') + —r^CT'a - ST) 


(36) 


If both C and D vanish at some value s', we have A(50 = 0 so that Stoneley waves exist. 
Otherwise there is no propagation of Stoneley waves. The case when cj, F > pob^, C and 
D will remain positive in ( 0 , 1 ). Since rj is positive, C < b, t\, Z2, t3 < 1 we discard 
the solution of ( 34 ) which is negative for some 5' in ( 0 , 1 ). Thus we see that Stoneley 
wave velocity under the influence of surface stresses may be less than or greater than 
the classical Stoneley wave velocity but always remains greater than the Rayleigh wave 
velocity. Moreover, in the case when 0 < <j < F < pob^, C and D will vanish exactly 
at one point {each on ( 0 , 1 )}, say 52 and 53 on (0, 1), so that s^ = ajpff?-, S3 = T jprffp- 
and hence 0 < S2 < S3 < 1 , where C is positive on ( 0 , si) and negative on (si, 1 ) and D 
is positive on (O, S2) and negative on (S2, 1 ) Again when cr, F lie in ( 0 , pob^), CT -\- DS 
and CT' + DS' vanish at say S4 and S5 in (S2, S3). We observe that CT + DS is positive 
on ( 0 , S4) and negative on (s4, 1 ) while CT' + DS' is positive on ( 0 , S5) and negative on 
(S5, 1). Thus the number of positive values of rj which satisfy (28) for a particular value 
of s depends upon the location of s\ related to S2, S3, S4, and S5, as well as on the value of 
2(s) and zeros of 2(s) where 


Q(s) = s^ 


{CT + DS){\ - S'T') + —xliCT' + DS'){\ - ST) 




+ 4 A(s)CD(l ~ ST)(1 - S'T') 


. (37) 


Further discussion in the light of Murdoch (1976) is not pursued in this paper due to its 
complicated nature and cumbersome calculation. 


5. Conclusion 

Stoneley waves under the influence of surface stresses may be discussed in a manner 
similar to Rayleigh waves. It is seen that the propagation of Stoneley waves under the 
influence of surface stresses is not always possible. Under certain restricted conditions 
such a propagation is possible. Moreover, Stoneley wave velocity under the influence of 
surface stresses may be less or greater than the classical Stoneley wave velocity, but always 
remains greater than the Rayleigh wave velocity. We point out here that Stoneley waves are 
dispersive in character in contrast with the classical situation. Rayleigh waves have been 
deduced as a particular case of Stoneley waves. In case of Love waves we have shown that 
the wave velocity equation does not depend on residual surface tension. Some numerical 
calculations highlight the effect of surface stresses on Love wave as described in the paper. 
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Recent Advances in Power Electronics and Drives 


Foreword 

Due to availability of self-commutated devices like power MOSFET, IGBT, and GTO, 
power electronics has undergone substantial change in recent years. It is used now in a 
very large number of applications. Some of them are: VAR compensators, active power 
filters, switch mode rectifiers, space and uninterruptible power supplies, industrial fans, 
blowers, compressors, steel rolling mills, paper and textile machines, mine winders, cement 
mills, electric vehicles, electric and diesel electric traction, high voltage dc transmission, 
induction heating, electrolysis, excavators, ship propulsion, missiles, satellite tracking 
stations, machine tools, robots, lasers, domestic appliances, defence electronics, flexible 
ac transmission etc. 

While the applications of power electronics are growing, the main emphasis of R & D 
has shifted to improvement of efficiency and performance, reliability, and power qual¬ 
ity, reduction in volume and weight, and in maintenance and down-time, longer life, 
fast/optimal response and mitigation of problems associated with harmonics and poor 
power factor. 

This issue of Sddhand aims at making the state-of-the-art reviews of the emerging topics 
available to its readers. Papers in the issue have been contributed by eminent researchers 
who are internationally known for their contributions in the respective areas. 

Performance and efficiency of converters, and associated drives and power systems, 
depend to a great extent on the modulation technique. Because of simple digital realisation, 
flexibility in adaptation to various applications and good performance, space vector pulse 
width modulation has become quite popular. V T Ranganathan provides a status review of 
space vector pulse-width modulation in the first article. 

Simulation permits detailed analysis leading to deeper insight into the behaviour of 
power electronic systems. Consequently one can select (or design) optimal parameters 
and control techniques, and then proceed to build the systems. Therefore, the accepted 
practice today is to first carry out detailed simulation study using the simulation packages 
developed particularly for power converters. The paper by M Dawande, Victor Donescu, 
Z Yao and V Rajagopalan reports the recent advances in simulation of power electronic 
converter systems. 

Static VAR compensators have been for many years an essential component in the 
operation of power transmission and distribution systems. While in transmission systems 
they support the fine voltage and stabilise the system, in the case of distribution systems, 
they allow improvement of power factor and provide support to line voltage. Availability 
of self-commutated devices has led to a new generation of VAR compensators known as 
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Modem VAR compensators or Synchronous Link Converter Based VAR Compensators 
or simply Static Compensators (STATCOM). They have the advantages of compact size, 
high efficiency, low harmonic content and stepless control of VARs from full lagging to 
full leading. The paper by Geza Joos on recent advances in VAR compensators covers 
compensators based on current and voltage source converters and on ac controllers, both 
in shunt and series configuration. 

The combined capacity of loads which pollute the power system with harmonics has 
reached intolerable levels, making it necessary to take measures to reduce the pollution 
to an acceptable level so that the quality of power supply can be preserved. Several of 
these loads consist of systems with thyristor-based converters as front-end converters, 
which generate harmonics and operate at low power factor. This problem of power system 
pollution is being tackled by two measures. First, a new generation of converters is being 
developed which will operate at unity power factor and draw sinusoidal current from the 
mains. This will substantially eliminate the poOution caused by converters. The second 
measure consists of evolution of active power filters to filter out harmonics. Unlike passive 
power filters, active power filters have compact size, higher efficiency, lower cost and better 
ability to filter harmonics. These issues of great concern to the present day power system 
have been tackled in three papers. The paper by Ned Mohan and G R Kamath on the recent 
advances in active power filters describes some unity power factor and sinusoidal current 
converters and the principle of active power filters. The advantages of resonant converters 
are well established. The paper by A K S Bhat on high power factor operation of resonant 
converters presents the operation and characteristics of series-parallel (LCC-type) resonant 
converters operating in discontinuous and continuous current modes with high power factor 
and low harmonic current distortion for wide variation of load. The review of single phase 
power factor correction by Ramesh Oruganti and Ramesh Srinivasan covers 1-phase ac- 
dc converters capable of operating at a high power factor and with harmonic distortion in 
source current within acceptable limits. 

The desire to transmit more power through a given transmission and distribution net¬ 
work and at the same time maintain the quality of supply has led to the development 
of a new area of research known as flexible ac transmission system. It employs power 
electronic converters. It is projected that by 2010 around 80% of electric power will flow 
through power electronic converters. K R Padiyar and A M Kulkami present a status 
review of flexible ac transmission systems in their paper and provide a generalised de¬ 
scription of FACT (flexible ac transmission) controllers and a review of progress in FACT 
systems. 

Another major application of power electronics is in the control of electric drives. In sev¬ 
eral applications, electric drives must have a fast torque response, four quadrant operation 
capability and controllability of torque and speed over a wide range of operating condi¬ 
tions. Traditionally dc motor drives have been used in these applications; ac motor drives 
have several advantages over dc drives. With vector control they are able to fulfill these 
requirements very well, hence vector controlled ac drives find wide application. Though, 
vector control by now has gained maturity it continues to be a topic of great interest to 
researchers. The review paper by A K Chattopadhyay discusses advances in vector control 
of ac motor drives and describes various aspects of vector control of induction motor and 
synchronous motor drives. 
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Switched reluctance motors have several advantages over induction and synchronous 
motors. Prominent among these is simple and robust constmction. Switched reluctance 
motor drives are therefore projected to be cheaper and more sturdy. Several problems are 
still to be solved before a commercially acceptable general purpose drive can be developed. 
The review of recent advances in switched reluctance motor drives by M Ehsani discusses 
various aspects of switched reluctance motor drives. 

Due to the advance in magnetic material technology and control algorithms, the appli¬ 
cations of permanent magnet brushless dc motors are growing at a fast rate, particularly in 
low power applications. The paper by Bhim Singh discusses recent advances in permanent 
magnet brushless dc motors both with rectangular and sinusoidal fields. 

The Indian Railways are in the process of modernising locomotives and EMU drives. 
Their ultimate aim is to replace dc drives by ac motor drives. Being a specialised topic, 
literature covering all the aspects is not easily available. In their status review on ac motor 
traction drives L Frederick and G K Dubey cover ac motor traction drives. 

It is hoped that this issue of Sddhand will provide its readers with a good exposure to 
recent advances in several topics of current research interest. 

On behalf of the Indian Academy of Sciences and on my own behalf, I wish to express 
sincere thanks to all contributors to this issue. 

I also wish to record my sincere thanks to Prof. N Viswanadhan, editor of Sddhand, 
for giving me an opportunity to edit this special issue. I also thank Prof. M A Pai of the 
University of Illinois for encouragement and help. 

Ms. Shashikala and her team at the Academy also deserve appreciation for working with 
interest and enthusiasm to bring out the issue in good shape. 


December 1997 


GOPAL K DUBEY 
Guest Editor 




Space vector pulsewidth modulation - A status review 


V T RANGANATHAN 

Department of Electrical Engineering, Indian Institute of Science, 
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e-mail: vtran@ee.iisc.emet.in 

Abstract. The technique of space vector pulsewidth modulation (SVM) is 
reviewed. The basic principle of SVM is derived and is compared with sine- 
triangle PWM. Operation in the overmodulation range is explained. Extension 
of SVM to other inverter-motor combinations such as three level inverters and 
split phase motors are discussed. 

Keywords. Pulsewidth modulation (PWM); space vectors; space vector mod¬ 
ulation (SVM). 


1, Introduction 

A large number of variable speed induction motor drives are in use today in applications 
as varied as pumps and fans, traction, machine tool spindles, printing presses etc. The vast 
majority of them are based on voltage source inverters, although other circuit configurations 
are possible. This is because the developments in power device technology have resulted 
in the availability of a number of power switches, such as MOSFET, IGBT and GTO, 
which meet every kind of power rating, from a few hundred watts to megawatts. Voltage 
source inverters, as is well known, produce pulsed output waveforms, consisting of the 
fundamental component which is required to drive the motor, as well as the harmonics 
which contribute only to the losses and torque pulsations. The method of determining 
the widths and the sequence of the voltage pulses produced by the inverter is known as 
Pulsewidth Modulation (PWM). 

The basic attempt in all Pulsewidth Modulation techniques is to produce the required 
amplitude and frequency of the fundamental voltage necessary for driving the motor, 
while moving the energy in the harmonics to a higher range in the frequency spectrum. 

^ The impedance presented by the motor to these harmonic voltages consists mainly of 
the leakage reactance. The expectation is that at such higher frequencies, even the reac¬ 
tance of the machine leakagd inductance will be appreciable, thereby limiting the har¬ 
monic currents drawn from the inverter. Also, since the torque pulsations created by the 
harmonic currents will also be at a higher frequency, the motor should be able to run 
smoothly. 
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The constraint on the PWM process is the fact that additional switchings per cycle are 
required in order to accomplish the modulation. This increases the losses in the inverter. 
Moreover, as switching patterns become more complicated, an individual inverter phase 
may be required to produce pulses or notches of very small width. The swiching times of 
the power devices used in the inverter impose a limitation here. 

Several popular PWM techniques have emerged over the years. The earliest one was 
the sine triangle method (Schonung & Stemmier 1964), in which the switching instants of 
the inverter legs are decided by the points of intersection of a high frequency (anywhere 
from 450 hertz to a few kilohertz) triangular wave and a sinewave at the frequency of 
the required fundamental motor voltage. The triangle is referred to as the carrier and the 
sine as the reference. The output voltage produced by the inverter contains the funda¬ 
mental component, of which the sine wave is a replica. In addition, it contains harmonic 
components which are located in bands around multiples of the triangle frequency. It 
is, in general, necessary to synchronise the carrier to the sine wave, in order to avoid 
frequency components in the output below the sine frequency - so called subharmonic 
components. 

The sine triangle method of PWM, though used widely, has its limitations. The amplitude 
of the sine wave has to be increased in proportion to its frequency, in order to keep the motor 
flux constant. Therefore, at higher motor speeds, a situation arises where the amplitude 
of the sine wave becomes equal to that of the triangle. If the sine wave amplitude were 
to increase any further, the linearity between the sine and the fundamental component 
of the inverter output voltage is lost. This situation is referred to as overmodulation. A 
further consequence of overmodulation is that frequency components with low harmonic 
numbers (5th, 7th etc.) begin to appear in the output voltage. From the implementation 
point of view also, sine triangle PWM requires some care. Where the triangle frequency is 
limited by the switching speed of the power devices, analog realisations give the best results 
in terms of output waveform quality. However, analog realisations require considerable 
ingenuity, especially to achieve synchronisation (Kliman & Plunkett 1979; Green & Boys 
1982). 

Subsequently, other techniques such as Selected Harmonic Elimination (Pollmann 1983), 
which were more amenable to digital implementation, have also been used widely. Today, 
the trend being towards standard hardware based on digital processors, it is preferred to 
implement all PWM techniques through real time software execution of algorithms. A 
number of software realisations of the sine triangle technique have been reported in the 
literature. They are good approximations of the analog realisation if the frequency of the 
triangle is high (at least 1 kilohertz or above). The limitation of the output voltage due to 
overmodulation effects is, however, still a factor to be considered. 

The above techniques have one thing in common in that the PWM patterns are calcu¬ 
lated for each phase, independent of switchings in the other phases. It has subsequently 
been realised that good results can be obtained if the combined effect of all the three 
output voltages of the inverter on the motor are taken into account in generating the 
switching patterns. The conceptual framework for doing so is the space vector modelling 
of ac motors. In the following sections, the notion of space vectors is introduced and 
the technique of generating PWM patterns based on these viz. space vector PWM is 
reviewed. 




Figure 1 . Three-phase half-bridge voltage 
source inverter. 


2. Basic space vector PWM (Holtz et al 1987; Van der Broek et al 1988) 

2.1 Space vectors 

Consider a three-phase half-bridge voltage source inverter, shown in figure 1. Each phase 
to centre-tap voltage , viz. Vro^ can have only two possible values, namely 

+Vic/2 or — Vic/2 respectively. As there are three switches, corresponding to the three 
phases, there are eight possible states for the inverter at any instant of time. Corresponding 
to each of the switching states, the motor line to neutral voltages can be determined using 
the following equations, 

V,i=Vrn = (1/3)(Vry-Vbr), 

V,i=(l/3)(2yRo - Vyo - Vbo), 

V52=(1/3)(2Vyo - Vbo - Vro), ^ ^ 

V.3=(1/3)(2Vbo - Vro - Vyo). 

From these three-phase voltages, equivalent two-phase voltages are defined as follows. 


Vsa^O/2)VsU 

V,i = (V3/2)(V,2- Vrf). (2) 
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Figure 2. Eight switching states of the 
inverter. 
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Figure 3. Space vector corresponding to each switch¬ 
ing state. 


The space vector of the machine stator voltage is then defined as 

Vs = Vsa + jVsb. (3) 

The eight switching states of the inverter and the corresponding space vectors of the 
machine stator voltage are shown in figures 2 and 3 respectively. 

It can be seen that at any instant of time, there are only eight possible positions for the 
voltage space vector. When the inverter is operated in the six-step mode, the switching 
states go through the sequence 1-2-3-4-5-6-1-2-3. The states 0 and 8 are not used at all. 
The space vector of stator voltage stays in each of the positions 1 to 6 for a time interval 
corresponding to 60° of the fundamental period and jumps to the next position at the end of 
every sixty degrees (one-sixth of the fundamental period). With above switching sequence, 
at every jump in the position of the voltage space vector, only one of the three phases of 
the inverter switches from top to bottom or vice versa. 

2.2 Space vector PWM 

The ideal trajectory for the voltage space vector is of course a circle described with uniform 
angular velocity, which results only when the motor is fed from a three-phase sinusoidal 
voltage source. The objective of any PWM process is therefore to approximate this ideal 
trajectory of the voltage space vector by switching amongst the eight standard positions. 
Towards this end, the continuously moving reference vector V* is sampled at a sampling 
frequency fg. During the interval Tg = l/fg between samples, the reference vector is 
assumed to remain constant. It is clear that for this assumption to be valid, the sampling 
frequncy should be fairly high compared to the fundamental output frequency desired from 
the inverter. 

Now consider that at a particular sampling instant, the reference V* is situated in sector 
I as shown in figure 4. The angle a represents the position of the reference vector with, 
respect to the beginning of the sector. It is intuitively clear that the reference vector can be 
reproduced best during the period till the next sample by switching the inverter to create 
the vectors Vi, V 2 , Vo and V 7 in some sequence. Selecting any of the other vectors would, 
result in a greater deviation of the actual vector from the desired reference and would thus 
contribute to harmonics. 
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Figure 4. Sampled reference vector located in sector I. 


The switching pattern can be calculated as follows. Assume that the sampling period Ty 
is divided into three subintervals T \, T 2 and To* The inverter is switched so as to produce 
the vector Vi for T\ seconds, V 2 for T 2 seconds and zero (either Vo or V?) for Tq seconds. 
The subintervals have to be calculated so that the volt-seconds produced by these vectors 
along the a and b axes are the same as those produced by the desired reference vector V* 
i.e. 

Vdc • Ti -f- Vdc • cos 60 • 72 = IV* I • cos a • jTj • • • a axis, (4) 

Vdc • sin 60 • T 2 = IV*! • sin a • 7^ • • • axis, (5) 

where |V*1 is the amplitude or the length of the reference vector. Define the amplitude 
ratio 


« = |V,*1/Vdc. 

(6) 

Then (4) and (5) can be rewritten as. 

7i -f 72 • cos 60 = a Ts • cos a. 

(7) 

T 2 ■ sin 60 = a • Ts ' sin a. 

(8) 

Solving for 7i and 72, 

Ti=Ts • a ■ sin(60 — a)/ sin 60, 

(9) 

72 = 7r • a • sin a/ sin 60, 

(10) 

II 

1 

1 

(11) 


As the reference vector moves to other sectors, the corresponding boundary vectors of the 
sector should be created during the intervals T\ and T 2 . 

In order to minimise the number of switchings in the inverter, it is desirable that switching 
should take place in one phase of the inverter only for transition from one state to another. 
This can be achieved if the following switching sequence is used: 

0-1-2-71-7-2-1-01-0-1-2... . 

Therefore the zero interval is divided into two equal halves of length Tq/2. These half- 
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Figure 5. Inverter phase to centre tap 
voltages. 


sequence during successive sampling intervals will then be as shown in figure 5. It can be 
seen that the chopping frequency fc of each phase of the inverter is given by: 

fc = fs/2- (12) 

It should be pointed out that during the sampling interval, the desired reference vector is 
approximated in the average sense, since the volt-seconds are equated. However, instanta¬ 
neously, the actual vectors produced by the inverter are different from the reference vector 
and therefore instantaneous voltage deviations or voltage ‘ripple’ exists. As a result har¬ 
monic currents will also flow in the machine. By following the above mentioned switching 
sequence, harmonics are to some extent reduced. 

2.3 Interpretation in terms of sine-triangle PWM 

In order to compare the space vector PWM with sine-triangle PWM, the mean value of the 
inverter phase to the dc centre tap voltages are calculated first, over one full cycle of the 
fundamental i.e. for one full rotation of the reference vector V*. Let r = 0 correspond to 
the instant where the average value Vrq of the phase to centre tap voltage goes through its 
positive zero crossing. At this instant, therefore, the reference vector will point vertically 
downwards as shown in figure 6. From this initial position, the reference vector rotates 
in the anticlockwise direction with angular velocity o )\. Its position at any instant t with 
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Figure 6. Reference time instant for calculating 
Vro 


respect to the starting position is given by the angle 0 — coit. For 0 < 6 < 30°, the 
reference vector is in sector V; for 30 <0 < 90°, the reference vector lies in sector VI. If 
the variation of Vro can be determined over these 90°, then its waveform over the rest of 
the cycle can be drawn using symmetry. 

For 0 < 0 < 30°, Vro = (V/c/2)(l/r,)[-ri + T 2 ]. (13) 

Further a = (30 + ^)°. Using the expressions (9) and (10) for T\ and T 2 , it can be shown 
that 


Vro = Vtc • ^ • sin0 = Vdc • ^ • sincuif. (14) 

For 30 < ^ < 90°, Vro = (W2)(l/7;)[-ri + T 2 I (15) 

However, a = 0 — 30°. Again, using (9) and (10) in (15), it can be shown that 

V-RO = (Vdc ■ (1/V3) ■ a ■ Sin(30 + ®if). (16) 

Using the expressions in (14) and (16) and taking advantage of symmetry, the variation 
of Vro over one full cycle of the fundamental can be drawn. The waveforms for the other 
two phases will be identical with phase shifts of 120°. The mean value of the line voltages 
can also then be drawn by taking the differences. Figure 7 below shows the variations over 
the positive half cycle of phase R. 



From the figure, it can be seen that the waveform of the mean value of the phase to 
centre tap voltage is not sinusoidal but contains triplen harmonics. These harmonics get 
cancelled in the line to line voltages, which are therefore sinusoidal. The machine line 
to neutral voltages will also be sinusoidal. Considering the waveform of the line to line 
voltage in figure 7, 

Peak value of mean line to line voltage = (2/\/3) • a • Vdc- (17) 

Recalling the definition of the voltage control ratio given by (6), the maximum value of a 
is given by 

<2max = radius of the lai'gest circle inscribed in the hexagon 4- Vdc- (18) 

Therefore, 

amax = (V3/2). (19) 

Substituting in (17), the maximum peak line to line voltage using SVM 

= (2/V3) • (V3/2) ■ Vic = Vdc- (20) 

The corresponding value for sine-triangle modulation is given by (V)/c/2) • Vj. It can 
therefore be concluded that SVM can produce 15 % more fundamental compared to the sine- 
triangle method without going into over-modulation. SVM can be regarded as a carrier- 
based technique, with the modification that the reference waveforms have triple harmonics 
added to them. In fact, modulators have been designed with a triangular waveform at three 
times the sine frequency being added to the sine references. 

2.4 Comparison with sine-triangle PWM in terms of waveform quality 

In general, comparison of the waveforms produced by different PWM techniques is carried 
out in terms of the ‘loss factor’ (Handley & Boys 1992; Holtz 1994). These are measures 
of the total rms harmonic currents flowing in the motor, usually normalised with respect 
to the harmonics produced by a six-step inverter (Holtz 1994). A comparison of SVM 
with sine-triagle in this respect shows that SVM produces less current harmonics than 
sine-triangle at higher modulation indices. At low modulation indices (< 0.4), there is no 
marked superiority of one technique over the other. 

3. Overmodulation 

As the fundamental output frequency of the inverter increases, and with it the amplitude of 
the reference space vector V* , the duration of time tQ to be spent at the zero vector becomes 
smaller and smaller. When the locus of the reference vector becomes the circle inscribed 
within the hexagon, to becomes zero. This is spoken of as the limit of modulation. For 
further increase in the length of V* , the locus lies partly outside the hexagon, implying that 
for some of the samples, the reference vector cannot be approximated in the volt-second 
sense by the SVM technique, even if the duration of fo is made zero. Different policies can 
be adopted to generate the pulse patterns for such loci. One technique is to apply equations 
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Figure 8. Average phase to centre tap voltage for overmodulation. 


(9) to (11) for those a values for which the locus lies inside the hexagon. For other values 
of O', the circular locus is forsaken in favor of the hexagon. This can be achieved by setting: 


= t]/(ti + 12); 


to=0. 


, \/3 cos a — sin O' 

i.e. ?! = (Ts) —= -^—, 

V 3 cos a -f sin q; 



( 21 ) 


This procedure scales down the subintervals of the non-zero vectors so that they add up to 
the sampling interval Ts , while setting the subinterval for the zero vector to zero. Therefore, 
there is an error in the amplitude of the motor voltage for such samples. The limit of this 
process is reached when the circular locus of the reference circumscribes the hexagon. 
The average phase voltage waveform of the motor will then have the waveform shown in 
figure 8. The amplitude of the fundamental phase voltage is 


9 


7t 


2 



( 22 ) 


The voltage of figure 8 contains low order harmonics such as the 5th and the 7th. 


4. Modified SVM or bus clamped SVM 

In conventional SVM, there are three switchings in the inverter in each sampling interval 
e.g. Vo, Vi, V2 and V7. If it is desired to reduce the number of switchings, an alternative 
sequence such as V 0 -V 1 -V 2 IV 2 -V 1 -V 0 can be thought of. Here the entire time for the zero 
vector is spent only at Vq. Compared to the conventional SVM, the number of switchings 
in the inverter becoms 2/3 in this case. The effect of such a switching strategy can be 
analysed along lines similar to that in § 2.1. The average pole voltage waveform can be 
shown to have a time variation of the form indicated in figure 9. This process can therefore 



be considered conceptually equal to a sine-triangle PWM , where a rectangular waveform 
at three times the sine frequency is added to each of the sine waves. Since one of the inverter 
phases is tied to a bus over 60° intervals, this technique has also been referred to as Bus 
Clamped SVM. This technique has the advantage that the lowest order harmonic generated 
is at a frequency 50% higher than that of normal SVM with the same switching rate. In 
addition, since there are only four swichings over a PWM period, there is less overhead 
in terms of loading timers, when it comes to a question of practical realisation. However, 
the performance in terms of harmonic content or loss factor is inferior to conventional 
SVM. 

5. Extensions of space vector modulation 

The majority of drives use a single inverter with a normal three phase ac motor. In high 
power drives, however, in order to limit the voltage and current ratings of the power devices, 
other configurations are also in use. Notable amongst these are: (i) the three-level inverter 
(Nabae etal 1981; Stemmier 1994) and (ii) the splitphase induction motor drive (Andresen 
1981; Gopakumar et al 1993). These configurations extend the power rating of the system 
by using double the number of switches i.e.twelve. Space vector concepts can be extended 
to such configurations also in order to generate the pulse patterns, as outlined below. 

5.1 SVM of three level inverter 

The configuration of a three level inverter is schematically indicated in figure 10. Each phase 
to centre tap voltage can now have three values viz.-\-Vdc/2, 0, — Vcic/2. By considering 
all the possible switching states and applying the definition of voltage space phasor in (3), 
it can be shown that the possible locations for the voltage space vector are as shown in 
figure 11. 

Although theoretically there are 3^ = 27 possible space vector locations, there are only 
18 distinct vectors in addition to the zero vector. Vectors 13 to 18 can each be realised by 
more than one switching state of the inverter. The zero vector can also be realised by three 
different switching states, namely (+ -|- 4-), (000), and (-). 


B 


Figure 10. Circuit diagram of three- 
level inverter. 






+ - + 
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Figure 11. Voltage space vector for three- 
level inverter. 

A strategy for generating the switching sequence can again be arrived at by looking 
at a 60° segment, as shown in figure 12 below (Kolenchery et al 1996). The sector can 
be divided into 4 triangles. Depending on the triangle in which the tip of the reference 
vector lies, the vectors and the associated switching states can be judiciously selected so 
as to approximate the reference in the volt-seconds sense as well as to keep the number of 
switchings in the inverter low. 

5.2 SVMfor split phase motors 

An alternative to the three-level inverter for high power drives is the split-phase motor 
(Andresen 1981; Gopakumar et al 1993). The split phase motor is obtained by splitting the 
phase belt of a conventional three-phase motor into two equal halves with aphase separation 
of 30° between the two halves (figure 13). The split phase groups are controlled by two 
inverters with a dc link voltage of Vdc/2 cos 15 each. Since the split phase configuration 




000 +00 +- - 

+++ 0- - Figure 12. 60° sector of space vector hexagon for three- 

— level inverter. 






(b) 


Figure 13. Configuration of split-phase induction motor drive, (a) Block diagram of split- 
phase induction motor drive, (b) Orientation of MMF vectors produced by the coils of a 
split-phase machine. 


is achieved by splitting the phase belts in two, the equivalent number of turns per phase 
for the new configuration is Ns/I cos 15, where Ns is the equivalent number of turns for 
the three-phase configuration. Correspondingly, a link voltage of Vdc/'2> cos 15 gives the 
same magnitude for the airgap flux in the new configuration as compared to the three phase 
inverter drive with slcIc link voltage of Vdc- 

The voltage space vector locations for the two sets of coils are shown in figure 14. The 
locations of one group are shifted by 30° from those of the other. There are 48 possible 
distinct space vector locations by combining the individual locations of the two groups. 
The boundary of all the locations is a 12-sided polygon, as compared to the hexagon for a 
single inverter. The reference vector can therefore be located in any one of the 12 sectors. 
One possible method of generating the pulse patterns for the switches is to approximate 
the reference vector by the 12 outermost vectors (Gopakumar et al 1993). The subintervals 
Ti,T 2 and Tq are then obtained as 

T\—2-a-Ts- sin(30 — a), 

T 2 = 2 • a • Ts • sina, (23) 

To = Ts-Ti-T2, 



Figure 14. Voltage space vector location 
for split-phase drive. 
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Figure 15. Average phase centre tap voltage 
for split-phase drive. 


where now a is the angle of the reference from the beginning of each 30° sector. By an 
anlysis similar to that in § 2.3, it can be shown that the average pole voltage has the time 
variation of figure 15. It can be seen that these contain harmonics such as the 5th and 7th. 
These harmonics, though they contribute to stator current, do not produce torque harmonics 
as their fields get cancelled out in the airgap because of the winding arrangement. With 
such a technique, it can be shown that an equivalent three-phase voltage of up to 0.643 
can be generated before entering into the overmodulation region. This can be compared to 
O.SllVcic for the normal three-phase system. 


6. Conclusion 

The concept of space phasors thus provides a unified framework for the generation of 
pulse patterns for inverters. It can be extended to configurations other than the simple 
three phase half-bridge inverter. Space phasor-based methods of PW M are also easily 
amenable to digital realisation. They can also be easily added on to advanced motor control 
algorithms such as vector control. Due to these reasons, such methods have become popular 
in industrial drives. Current control techniques such as hysteresis control can also be 
adapted for execution in the space phasor domain, although it has not been possible to 
present such methods here. 
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Abstract. This paper presents a comprehensive state-of-the-art of the recent 
advances in simulation of power electronic converter systems. Knowing the 
importance of simulation, this paper reviews the various methods of modelling, 
circuit analysis approaches, numerical techniques etc. Several general purpose 
simulators and dedicated power electronic simulators have been discussed. 
A few demonstrative examples of simulation of power electronic converters 
by using different simulators are provided. Practic.al difficulties in simulation, 
challenges, new developments and scope for future work are also discussed. 

Keywords. Power electronic converter; power electronic simulator. 


1. Introduction 

Power electronics has gone through intense technological evolution during the last three 
decades. Power electronic converter systems have captured a major place in it, as they find 
wide applications in industry. It has been reported in the recent IEEE PELS (Newsletter 
1997) that ‘Analysis and control of power converters’ is the area of highest interest (65%) 
in power electronics. At present, in research, design and development power electronic 
converter systems, modelling and simulation are widely used and became indispensable 
before practical hardware implementation. 

Design of power electronic converters requires fairly sophisticated analysis, simulation 
tools and detailed design procedures involving tolerance analysis and worst-case design 
before building the actual system. The computation of steady state solution is an essential 
first step in most of the design studies (Rajagopalan 1987). Computer-based design au¬ 
tomation tools are expected to have an increasing influence on improving the reliability of 
power electronic systems (Kang & Lavers 1994). Accurate, fast and cheaper simulation 
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Figure 1. Flow chart for design process. 


methods are required for (1) comprehensive system performance, (2) efficient designs, (3 ) 
discovering possible problems due to abnormal operating modes, (4) optimal design, (5) 
sensitive study based on the various parameters, (6) knowing the behavior of circuit at 
various levels, (7) predicting the interaction with other networks, and (8) avoiding destruc¬ 
tive laboratory tests etc. Therefore, the normal design process should follow the algorithm 
given in figure 1. It is always advisable to study the system thoroughly, to determine the in¬ 
fluence of all the parameters, to raise the maximum doubts for its worst-case operation and 
get the solutions through simulation, before finalizing the optimal design of power elec¬ 
tronic converters. Micro-modelling and macro-modelling of power semiconductor devices, 
new simulation tools and user-friendly environments are currently available for the sim¬ 
ulation of power electronic converters (Rajagopalan et al 1990, 1992, 1995; Rajagopalan 
1995). 

This paper gives a comprehensive state-of-art of the several simulation techniques em¬ 
ployed to the power converters. It describes the micro- and macro-modelling of power 
components, time and frequency domain analysis, different circuit analysis approaches, 
various numerical techniques used in simulation etc. Several general purpose simulators 
such as MATLAB-SIMULINK, SIMNON, EMTP, EMTDC, PSPICE, SABER etc. are 
reviewed. Several dedicated power electronic circuit simulators such as ATOSEC5, PSIM, 
SIMUPELS are briefly discussed. Computer-aided learning tools like CATPELS are also 
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mentioned. Some examples are demonstrated in different simulators. The paper discusses 
some of the practical difficulties in the simulation process and explores the challenges that 
are to be overcome to make simulation more efficient and attractive to design engineers. 
Finally, the future trend of systematic controller design, i.e. hardware implementation 
directly from digital simulation, is introduced. 


2. Simulation techniques 

The importance of simulation is wisely refeired to as ‘Question of questions’ (Kassakian 
1981) and many simulators have evolved after realizing its importance. General purpose 
simulators such as MATLAB-SIMULINK, SIMNON etc. are widely used in various en¬ 
gineering disciplines while PSPICE, SABER etc. are used especially in electrical circuit 
analysis. Simulators like EMTDC, EMTP are mostly used in power system area and a few 
circuit simulators such as ATOSEC, PSIM, SPICE, SIMUPELS, KREAN are dedicated 
to power electronic converter systems. 

2.1 Micro- and macro-modelling 

All currently available power electronic simulators using either software packages such 
as EMTP, PSPICE, SIMNON and MATLAB-SIMULINK or dedicated power electronic 
simulators such as ATOSEC5, and PSIM use a macro-modelling approach for modelling 
power semiconductor elements. The micro-model used for the semiconductor switch can 
be any one of the following classes. 

(i) Ideal; 

(ii) two-valued resistor; 

(iii) two-valued inductor; 

(iv) subcircuits; 

(v) equations describing switch. 

The ideal model gives a simple idea of short circuit when the switch is ‘on’ and open 
circuit when the switch is ‘off’ (Lai 1994). Two-valued resistor (Kang & Lavers 1994) 
and two-valued inductor (Rajagopalan 1988) models implement the low impedance in the 
‘on’-state and high impedance in the ‘off’-state. A number of sources, resistors, and capac¬ 
itors make the switch model with subcircuits, while the nonlinear equation, representing 
the switch is used in the equation switch model. This modelling approach has been used 
and validated by several researchers for the analysis of large power electronic convert¬ 
ers. It is useful to develop a library of models for the various components of the power 
electronic system, namely, power semiconductor devices, electrical machines, power elec¬ 
tronic converters and power electronic control strategies. Such a library of models must be 
available in a user-friendly environment which can be advantageously used for building 
power electronic converter circuits and systems for analysis. These user-friendly models 
are nnw available fnr varinns simnlarnrs snrh as .SARF.R P.STM and STMTIPRLS etc. 
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2.2 Time /frequency domain 

Simulation techniques can be based on the time domain or the frequency domain principle. 
To study the switching operation effect on power converters or high switching frequency 
operated converters time domain analysis is usually used (Liu et al 1993; Tymerski 1993). 
Analysis, design, steady state or transient state conditions of the controller or power elec¬ 
tronic converter are usually studied by frequency domain based simulation techniques 
(Maranesi 1992). It is not always easy to derive models for frequency domain based anal¬ 
ysis for complex power converter topologies. 

2.3 Circuit analysis approach 

Simulation techniques are based on any one or two of the following circuit analysis ap¬ 
proaches. 


2.3a Circuit modelling techniques: (i) State variable approach (SVA); (ii) nodal analy¬ 
sis (NA); (iii) modified nodal analysis (MNA); (iv) state-space averaging technique (SAT). 

The SVA has been widely used in the power electronics area. This approach formulates 
a set of circuit equations in terms of branch voltages and currents and they are called 
state equations. These state equations can be further solved by any numerical method. 
SVA approach has been used by dedicated power electronic simulators such as ATOSEC5 
(Rajagopalan & Sankara Rao 1975), SCRIPT (0ms et al 1989), COSMIR (Hsiao et al 
1989), CAP (Liu efa/ 1993). 

The traditional nodal analysis based approach is used in EMTP in the area of power 
systems. Nodal analysis based power electronics simulator is also reported as PECAN 
(Lavers et al 1990). However nodal analysis-based simulators are not so efficient as they 
are incapable of handling current depending elements, performing fast steady state analysis, 
and it is difficult to obtain the branch currents in many power electronic circuits. These 
difficulties are overcome to some extent in Modified Nodal Analysis (MNA) as it includes 
circuit components such as voltage sources and other current depending elements. This 
approach has been used in many simulators such as PSPICE, SABER, KREAN etc. MNA 
has been implemented for switch modelling also (Sudha et al 1993). 

State-space Averaging Technique (SAT) is mainly based on state averaging concept. This 
technique converts a nonlinear, time varying, switched circuit into a equivalent circuit by 
averaging state variables. It is widely used for the analysis of switched mode converters 
(Middlebrook 1989). Direct implementation of SAT is usually done in SPICE simulator. 
Recently, computer-aided modelling technique of dc-dc converters, using SAT, is also 
proposed (Chapman & Cichards 1992). 

2.3b Symbolic math analysis (SMA): In Symbolic Math Analysis, the most common 
modes of operation of a converter topology are identified together with the corresponding 
equations for the mode changes. The analysis consists in solving the system equations 
from an initial operating mode with arbitrarily chosen or known initial conditions through 
a pre-determmed sequence of the different operating modes for a whole cvcle of ooeration. 



The analytical solutions for each mode of operation is derived for each mode in terms of 
the unknown initial conditions. Each mode of the system is governed by a set of total dif¬ 
ferential equations. Generally, the conditions for mode changes are governed, by nonlinear 
equations. Therefore, the solution for a complete cycle requires the computation of all the 
initial conditions through an iterative procedure. With the availability of general purpose 
software such as MATLAB and its toolbox called Symbolic Math Analysis or others such 
as Mathematica, Mapel etc., it is now possible to use the sequential method of analysis for 
the generation of design curves for new circuit topologies (Rajagopalan 1995). This ap¬ 
proach has been used to develop the switch power converter simulator named SPANTRAN 
(Liberatore et a/1991). 

There are a few other analytical methods which are used for power electronic converter 
simulation. Sample-data modelling techniques have been specifically used for the study 
of dc-dc converters which permit derivation of suitable transfer functions of the system 
thereby permitting traditional control system design methods to be used for the tuning of 
control parameters (Verghese et al 1986). 

2.4 Numerical methods 

Various numerical integration methods have been used in power electronic simulators. 
Traditional methods such as Newton-Raphson, Runge-Kutta, Backward Differential For¬ 
mula (BDF), Backward Euler, Trapezoidal, Implicit higher order BDF named Gear etc., 
are implemented in simulators (Mohan et al 1994). 


3. Simulators 

Many simulators are now available for the simulation of power converters. In the following 
section, some of the simulators with which authors are familiar are described. 

3.1 General purpose simulators 

3.1a MATLAB-SIMULINK: MATrix LABoratory (MATLAB) is a high performance 
iterative package for scientific engineering numerical computation (Matlab 1992). MAT¬ 
LAB integrates numerical analysis, matrix computation, signal processing and graphics 
in an user-friendly environment. SIMUlation LINK (SIMULINK) within the MATLAB 
works in two phases, (1) model definition (2) model analysis (Simulink 1992). It constructs 
simulation by building blocks or modules. SIMULINK has graphical interfaces for build¬ 
ing blocks. It uses the simplified switch models for power electronic converters. It uses 
the state equation models to solve any continuous/discontinuous, linear/nonlinear system. 
This simulator has a great facility to exchange the data between MATLAB and SIMULINK. 
It generates the graphical output using built-in functions. Simulink simulation slows down 
for the large power electronic converter systems because of its constant passing of data to 
and fro from MATLAB. The user-friendly graphical capability of MATLAB and various 
controller design functions are of great advantage in this simulation environment. 
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3.1b SIMNON: This simulator provides the time domain responses. It requires a sepa¬ 
rate tool for dealing with frequency domain analysis. It constructs simulation by building 
blocks or modules. It uses the state equation models to solve any continuous/discontinuous, 
linear/nonlineai* system. It generates the graphical output using built in functions. The sim¬ 
plified switch models are used in power electronic converter’s simulation. SEMAS is an 
example of a software that has been built in SIMNON environment for the time domain 
analysis of power electronic systems (Ba-Razzouk^ra/1993). New SIMNON for windows 
further provides user-friendly interfaces (Simnon et al 1993). 

3.1c EMTP: Electro Magnetic Transients in Power system (EMTP) is widely used 
in large scale power systems (Meyer & Liu et al 1974) . This simulator has models of 
transformer, transmission lines, machines and switch models such as diodes, thyristors 
etc. The nodal approach is used for solving the circuit equations. Here, trapezoidal or 
factorization methods are used for solving these equations and the time step for integration 
is specified by the user. 

3.Id EMTDC: EMTDC is basically a circuit analysis program (EMTDC 1986). It can 
be used to model any dynamic systems that can be constructed from building blocks 
consisting of integrators, limits, logic functions, delays etc. known as Continuous System 
Modelling Functions (CSMF). Furthermore, electric circuits and CSMF can be interfaced 
to create a model of both the circuit and controls. EMTDC can be used as a time domain 
simulation program for electric power and control systems. It also supplies methods for 
frequency scanning. Usually it is used for studying dynamics of power systems with various 
converters such as HVDC, STATCOM etc. 

3.1e SPICE: Simulation Program with Integrated Circuit Emphasis (SPICE) is used 
widely in electronic circuit analysis (Nagel 1975). It offers the various models, subcir¬ 
cuit libraries, graphical pre-processors and post-processors etc. Both time-domain and 
frequency-domain analysis are possible in SPICE. The circuit equations are developed by 
nodal approach and Gaussian elimination technique is used for solving these equations. 
Voltage dependent switches are modelled by nonlinear resistors. Nonlinearities can be 
resolved within the program. However the choice of Trapezoidal or Gear methods is also 
possible. Here, the time steps are automatically adjusted within the programs. SPICE is 
used in power electronics for simulation at low power levels e.g. Switch Mode Supplies. 
SPICE is available in Personal Computer environment, known as PSPICE. It is very pop¬ 
ular for circuit analysis because of its availability at very reasonable cost for classroom 
environments. 


3.If SABER: This simulator is well suited for nonlinear analog and digital circuits, 
especially for mixed mode circuits in power electronics (Xu 1990). It offers device mod¬ 
els, machine models, models for mechanical parts such as gears, dampers, fans etc. This 
simulator allows the developing of new models in dedicated description language MAST 
(Franz et al 1990). Designers can use other languages such as , FORTRAN, C, Pascal etc. 
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for developing the new models. This additional facility of external modelling makes the 
SABER more useful in power electronics. 

3.2 Dedicated power electronic simulators 

3.2a ATOSEC5: Analyse Topolologique des Systemes Electroniques en Commutation 
(ATOSEC5, French acronym) is power electronics dedicated simulator (Rajagopalan & 
Sankara Rao 1975). The power electronics system is described in the form of a set of state 
equations. These state equations are solved by Backward Euler method. In order to ensure 
the numerical stability the Newton Raphson iterative method is used for the solution. A 
constant time step is chosen for the solution of state equations. Several subroutines are 
used for FIRING, BLOCK, EXTFIR for well defining the power electronic systems. This 
offers a library of semiconductor switches such as diode, transistor, FET, GTO, MOSFET 
and special purpose switches such as PWM, hysteresis etc. User can define special kind of 
switch which is not covered in ATOSEC through EXTFIR and it can be used in principle 
subroutine for the solution. ATSECG interface module can be used to prepare the schematic 
diagram of the circuit. The circuit element values can be shown in a specific manner 
and can be used in a data file which can be obtained from ATOSECG interface module. 
ATOPLOT interface module shows the simulated curves which can be visualized during 
simulation. Any power electronic system, e.g. circuit with 100 branches, 60 nodes and 
any 40 semiconductor switches, can be easily handled by ATOSEC. All the programs are 
written in FORTRAN 77. 


3.2b PSIM: Power electronic circuit SIMulator (PSIM) is a dedicated power electronic 
simulator (Jin 1995, 1996). It uses the nodal and trapezoidal approaches for circuit equa¬ 
tions. It contains the library of several switches such as diodes, thyristors, transistors and 
bi-directional switches. It also contains the commonly used circuits such as three phase 
transformers, thyristor converters, PWM voltage/current source inverters. Apart from us¬ 
ing the actual control circuit, transfer function blocks and other function blocks can also 
be used in PSIM. All programs are written in FORTRAN. The first step is to create or edit 
the power electronic circuit in OrCAD or SIMCAD and then the circuit is simulated in 
PSIM and results can be viewed in SIMVIEW. Both SIMCAD and SIMVIEW are written 
in Microsoft Visual 


3.2c SIMUPELS: SIMUlation of Power Electronic Systems (SIMUPELS) is a user- 
friendly simulator built in popular MATLAB-SIMULINK environment (Ba-Razzouk et 
al 1995). Power semiconductor devices are modelled using SIMULINK. It also explains 
how to use the generated device models to construct any power electronic converter. Nodal 
approach and mesh type formulation of circuit equations are used. Extensive libraries of 
converters, control circuits, and electrical machine models are included in the simulator. 
The simulator contains several examples of simulation of many power electronic systems; 
in particular a number of case studies dealing with ac and dc drives are available. All the 
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3.2d KREAN: This simulator is dedicated to power electronics especially for power 
converters with high switching operation (KREAN 1990; Nilssen & Mo 1990). All switches 
are modelled as ideal switches. Nonlinear components are described in FORTRAN sub¬ 
routine and linked to the main program. Nodal approach is used for developing circuit 
equations. Runga-Kutta, Newton-Raphson, and sparse matrix methods are used for solv¬ 
ing these equations. 

3.3 Power electronic tutor (CATPELS) 

Computer-Aided Tutor in Power Electronic Systems (CATPELS) is a useful multimedia 
tool which helps the user to understand a simulation and a subject from basic to advanced 
level. This involves sophisticated computer-aided analysis and design tools. This tutor 
outlines the major requirements and concepts of a computer-aided learning aid for power 
electronic systems and gives the characteristics of a prototype of an expert system named 
CATPELS (Rajagopalan et al 1995). The open architecture of the run-time version of 
the CATPELS software provides not only the basic information for beginners and all the 
required tools for specialists but also permits a user (an instructor or a researcher) to expand 
its capabilities and to customize as per specific requirements. 

4. Demonstrative examples 

A commonly used example of three phase self commutated converter, shown in figure 
2a, has been analysed and simulated. Figure 2b shows the simulated model in MATLAB- 
SIMULEnIK within SIMUPELS (Ba-Razzouk et al 1995). Here, the power converter mod¬ 
els are developed on the basis of switching function theory. The voltages at the converter 
input are synthesized from the dc output as given in (l)-( 3 ) below. 

( 1 ) 

( 2 ) 

( 3 ) 

The input current depends on the output currents, 

lout = ij.Sl J 2 -S 3 -f- 13 .S5 C^) 

The appropriate space-vector PWM algorithm for the converter is implemented with 
the MATLAB files. The other auxiliary components referring to the generation of the 
space vector polar coordinates, the command circuit avoiding the short-circuit and the 
clock generator have been grouped in the desired parameters of the PWM generator (pulse 
frequency, number of pulses and modulation index etc.) in a user-friendly manner. To¬ 
gether with the modules for power converters and PWM techniques some special modules 
are designed for ac filters and loads allowing an easy adjustment of desired component 
values. 


Vout 

= —(2.S1-S3-S5), 
Vout 

e 2 = —(2.S3-S1-S5), 
Vout 

e 3 = -^(2.S5-Sl-S3). 




(b) 

Figure 2 . (a) Three phase voltage source converter, (b) Simulated in MATLAB-SIMULINK. 

Three more demonstrative examples are provided in figures 3-5. Figure 3 shows the 
single-phase diode boost rectifier with single-switching device on dc side in PSIM. Figures 
4 and 5 show the three-phase boost rectifier with single-switching device on dc side and 
simulated in MATLAB-SIMULINK and SABER respectively (Dawande & Dubey 1996). 
Figure 6 shows a typical case study in CATPELS with various menu options available to 
the user for ac-dc converters. 


5. Difficulties and challenges 

When the simulation of power electronic converters is carried out, a number of problems 
may occur due to imperfect models used for devices resulting in lower accuracy. To cite 
an example, when using macro-models for semiconductor power switches, it is not possi- 








Figure 4. Three-phase boost rectifier in MATLAB-SIMULINK, 
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Figure 6. ac-dc converter study in CATPELS. 
































ble to calculate accurately the power losses in the devices during tum-on and turn-off of 
the devices. In order to calculate accurately these transitions, sophisticated physics-based 
semiconductor models (micro-models) are required for the switches. Use of such models 
for the study of power electronic converters increases the complexity, reduces the com¬ 
putational efficiency and even causes numerical unstability. Modelling of bipolar devices 
such as IGBT, GTO etc. are devoid of carrier diffusion property. Most of the switch models 
use a single variable equation and do not use the time-dependent partial differential equa¬ 
tion for the proper characteristic of the device. Simulation in power electronics becomes 
difficult when there are sharp discontinuities in the circuits. The study of converter-fed 
machine requires extensive time for data preparation and solution. 

The availability of various simulators dedicated to specific domains have some advan¬ 
tages and disadvantages when chosen for a given application. A universal simulator with 
a facility to link all the simulators in a user-friendly environment will be welcome. This 
can provide the study of a particular condition of a power electronic converter system by 
using a simulator that is most suitable from a library or bank of standard simulators. For 
the sake of discussion, consider the analysis of a power electronic converter system. The 
expected standard bank contains several simulators Ai, X 2 , X 3 ,... Suppose Xi deals 
with the modelling of devices, which can model the converter with thyristors, MOSEFTs, 
IGBTs etc., e.g. NIST simulator; X 2 deals with steady/transient state response in time and 
frequency domain e.g., SIMUPELS; X 3 deals with high voltage application software like 
EMTP/EMTDC etc. Then the study of any converter system at all levels of application 
could be easily possible, without any limitations of a certain simulator, in a user-friendly 
environment. It will be beneficial for all university researchers and industry designers to 
have such a universal tool readily available for the analysis of any power electronic con¬ 
verter system. However, this approach is feasible only if such simulators are available at 
reasonable cost with time-sharing possibility on the Internet. 


6. Future trends and developments 

In the next few years, computer-aided design of power electronic converters will be more 
needed to reduce the design and manufacturing cost. The system which can be designed 
to control, implement and test the high speed control algorithms of power electronic con¬ 
verters and which can bring the hai’dware in the loop simulations in real experiments will 
be essential for future applications. 

Presently, the link of simulation and hardware has been brought out by using dSPACE 
system in our research group. This system consists of Digital Signal Processor (DSP 
TMS320C40), input/output boards, graphical tool TRACE etc. The following systematic 
process is adopted. 

A simulated system of harmonic compensation of three-phase converter is developed in 
MATLAB-SMULINK. The necessary controller is designed in SIMULINK and grouped 
in the block of ‘compensator discrete’ (Yao 1997). After achieving the satisfactory simu¬ 
lated results, the controller is modelled in dSPACE environment with necessary input/output 
channels which are shown in figure 7a. This has to be built from the selected hardwaire 
library of dSPACE. Channels 1-3 sense the three input currents of a three-phase converter. 



Simulation of power electronics converter systems 


701 


while channels 3-6 sense the input voltages of a three-phase converter, through ADC board 
(DS2003). The processed output signals of the controller are delivered to hardware through 
DAC board (DS2103). The dSPACE Real-Time Interface (RTI) contains all the interface 
I" ' software necessary to integrate the combined development of the MATLAB, SIMULINK 
and Real-Time Workshop. Code generation from SIMULINK block diagrams is fully auto¬ 
matic. Figure 7b shows a real-time interface which integrates the simulation and hardware. 
Figure 7c shows a controller model which has been downloaded to DSR All the necessary 
input/output signals along with the controller signals are seen through graphical tool called 
TRACE in dSPACE. The option menu of TRACE for various signals is seen in figure 7c. 
The implementation of the simulated controller, directly to hai'dware using dSPACE is 
successfully done. The actual experimental monitored signals are seen in figure 7d. 
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Figure 7. (a) Harmonic compensator model developed in dSPACE. 
(b) Real Time Interface integrating simulation and hardware, (c) Graph¬ 
ical TRACE tool showing simulated and experimental signals, (d) Mon¬ 
itored experimental results. 


7. Conclusions 

This paper gives a comprehensive state-of-the-art review of power electronic simulation 
techniques. It reviews various methods of modeling, circuit analysis approaches and numer¬ 
ical methods used in different simulation techniques. Several general purpose simulators 
such as MATLAB-SIMULINK, SMNON, EMTP, EMTDC, SPICE, SABER etc. and ded¬ 
icated power electronic circuit simulators like ATOSEC5, PSIM, SIMUPELS, KREAN are 
discussed in brief. Several demonstrative examples using different simulators are provided 
to show the importance and ease of simulation in power electronic converters. Consider¬ 
ing the use of hardware in simulation which may be the essential future demand in power 
electronic converter systems, some new developments are introduced to show the wide 
scope of research and developments. 
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Recent advances in var compensators 
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Abstract. Static var compensators have been, for many years, an essential 
component in the operation of power transmission systems. They are part of 
a family of devices known as Flexible AC Transmission System (FACTS) de¬ 
vices. The advent of large capacity force-commutated semiconductor switches 
allows many developments in power electronic converters to be applied to the 
implementation of high power compensators. This paper describes the princi¬ 
ples of controlled reactive power compensation, particularly in the context of 
power systems. It focuses on active static power converter-based compensators 
and discusses issues related to the power circuit topology and control tech¬ 
niques, including the impact of Pulse Width Modulation (PWM) techniques. 
Compensators based on current and voltage source converters and on ac con¬ 
trollers, both in the shunt and series configurations, ai'e covered. Methods to 
enhance power capacity using multi-level and multi-pulse arrangements are 
discussed. 

Keywords. Reactive compensation; power electronic converters; power 
systems. 


1. Introduction 

Reactive power (var) compensation has long been recognized as an essential function 
in the operation of power systems. At the distribution level, it is used to improve the 
power factor and support the voltage of large industrial loads, such as line commutated- 
thyristor drives and electric arc furnaces. Reactive power compensation also plays a crucial 
role at the transmission level in supporting the line voltage and stabilizing the system. 
Rotating synchronous condensers and mechanically-switched capacitor and inductor banks 
have been replaced in the 1970s by thyristor-based technologies: in typical installations, 
a thyristor-controlled reactor (TCR) provides variable lagging vars, and fixed or thyristor- 
switched capacitors (TSC) provide the leading vars. The combination of both devices in 
parallel allows continuous control of vars over a wide range of values, from leading to 
lagging vars (Gyugyi 1979). A large number of units have been successfully installed and 
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static power converters have also been recognized and a number of configurations proposed 
and investigated (Gyugyi 1979). 

However, thyristor technology only allows the implementation of lagging var generators, 
unless complex force-commutation circuits are used. This drawback has been eliminated 
with the introduction of Gate Turn-Off (GTO) thyristors (Larsen et al 1992). This has 
allowed the development of a number of configurations based on the use of synchronous 
voltage sources (Gyugyi 1993). Prototype GTO-based var compensation units, known 
as STATCOMs have been installed and tested by utilities (Schauder et al 1995). The 
STATCOM and other static var compensators have recently been grouped, together with 
other types of transmission system control devices, under the heading of Flexible AC 
Transmission System (FACTS) devices. 

Reactive power compensators are typically connected in shunt across transmission and 
distribution systems. An alternative connection, the series connection, has recently received 
much attention from utilities (Gyugyi et al 1996). Technological solutions have been 
developed to solve problems associated with insulating the equipment from ground and 
the full potential of series connections can now be exploited. The latest development in var 
compensation technology has been the combination of series and shunt static compensators 
into one unit, known in the area of power systems under the name of Unified Power Flow 
Controller, or UPFC (Gyugyi 1992). 

Static power converters have been successfully applied to a large number of power 
conversion problems at low and medium power levels. However adapting these solutions 
to high power transmission and distribution levels raises special issues. Although the 
capacity of power semiconductor switching devices has gradually increased, large ratings 
still require combining devices in series and parallel. In addition to the large power handling 
capacity, static compensators must have very high efficiency, since losses have a negative 
impact on both the initial and operating costs of the power system. Switching losses are 
therefore a primary concern and switching frequencies must therefore be kept low. This may 
result in large harmonic waveform distortion, unless special power circuit configurations 
are used. 

This paper reviews the various methods available for generating reactive power (var) 
by means of force-commutated static power converters, taking into account the above 
constraints. It discusses topologies suitable for use with devices such as GTOs and the 
more recently available high power IGBTs and addresses switch gating issues, including 
the use of Pulse Width Modulation (PWM) techniques. Methods for designing high power 
converters suitable for transmission level compensation are presented, particularly multi¬ 
level and multi-module topologies. 


2. Principles of var compensation 

Var compensation can be viewed as the injection of reactive power, leading or lagging, 
into the ac system. In its simplest form, reactive power injection is achieved by inserting 
fixed capacitors or inductors in either series or shunt into the ac system. Assuming a 
compensation reactance Xc is inserted in a transmission system, the generated var Qc is 
derived as follows. 
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(a) y^/S; 




Figure 1. Principles of var compensation in transmission systems, (a) Shunt compensation, 
(b) Series compensation. 


(a) For the shunt connection, figure la, a reactive current Icq is generated, allowing in 
particular line voltage support at the point of connection, Vc'^ 

Jcq — ^cj^ci 

Qc = vllX^. 


(b) For the series connection, figure lb the reactive impedance Xc partially compensates 
the line reactance, and a reactive voltage Vcq is inserted in series, the current Ic being 
the line current: 


Vcq=IcXc 

Q, = l}x, 


In addition to the lack of controllability of the reactive power injection, fixed capacitive 
compensation can lead to ac system instability, such as in the phenomenon known as 
subsynchronous resonance, or SSR, associated with series compensation (Ooi & Dai 1993; 
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impedance must be varied. Equivalently, a variable current or variable voltage is injected 
into the system, emulating a variable reactance, figure lb. 

The apparent reactive impedance of a fixed element can be varied using ac switches, 
or ac controllers. On the other hand, the current or voltage required to emulate a variable 
reactance can be injected into the ac system by means of synthetic sources, which can be 
realized using static power converters (Agaki et al 1984; Campos et al 1994). In addition 
to providing reactive power, these active compensators can also supply real power, either 
transiently or for a number of periods of the ac supply. This real power can be used 
to dampen power system oscillations or temporarily support the power system voltage 
under fault conditions. Furthermore, since the compensator is fully controllable, resonant 
frequencies associated with the use of capacitors are eliminated and the potential for 
instability suppressed. 




(c) 

Figure 2. Thyristor-controlled variable reactance compensators, (a) Thyristor-controlled 
reactor (TCR) and thyristor-switched capacitor (TSC) shunt compensator, (b) Transfer char¬ 
acteristics of a TCR with fixed capacitor, (c) Thyristor-controlled series capacitor compen¬ 
sator (TCSC). 



3. AC controller-based structures 


3.1 Conventional thyristor-controlled reactor 

The basic scheme, figure 2a, consists of an ac controller which varies the apparent induc¬ 
tance of the inductor as reflected onto the ac line (Gyugyi 1979). The thyristor-controlled 
reactor (TCR) provides continuously controllable lagging vars and is biased using fixed, 
or more often, thyristor-switched capacitors (TSC). The injected vars can therefore be 
continuously adjusted from leading to lagging, figure 2b. However, the var injection or 
voltage regulation capability of the compensator is limited by the value of the reactance 
and is therefore line voltage dependent, figure 2b. Advantages of the system however in¬ 
clude ruggedness, high efficiency, good dynamic performance and competitive cost. The 
disadvantages include injection into the line of large low frequency harmonic currents, 
dominant being the 5 th and 7th components (300 and 420 Hz for a 60 Hz system) for the 
basic ac controller. Harmonics can be moved to higher frequencies by paralleling units 
and using special transformer configurations. Harmonic currents can be reduced by means 
of tuned LC filters. These however are costly and can cause voltage oscillations resulting 
from the added system resonant frequencies. 


3.2 Force-commutated ac controller structures 

An alternative to the thyristor-based ac controllers is the force-commutated ac controller, 
figure 3a (Jin et al 1994; Venkataramanan & Johnson 1997). The use of force-commutated 
switching devices removes the requirement for operating the converter in synchronism with 
the ac supply and allows gating the switches more than once per cycle. Arbitrary gating 
patterns can be implemented, particularly PWM patterns (Jin et al 1994). The principles of 



pacitors are not shown), (a) Basic ac controller structure, (b) Multi-module converter. 




/ lU 


uezu juos 


PWM pattern generation are explained in § 6.2. Assuming the inductor current is sinusoidal, 
a pattern with constant duty cycle yields ac line side currents that only contain harmonics 
around the switching frequency and its multiples. This pattern is simple to implement 
and allows control of the equivalent inductance; therefore, the amount of vars absorbed 
can be varied from 0 to a maximum value. Since losses associated with switching large 
currents at high voltages increase significantly with switching frequency, this frequency 
must be kept low, typically a few hundred hertz for GTOs. In order to reduce the distortion 
of injected currents, while keeping the switching frequency low, elementary modules are 
connected in parallel, figure 3b, and gated so that harmonics are minimized (see discussion 
in § 7). This also allows the realization of var compensators of large ratings (Lopes et al 
1996). 

3.3 Thyristor series controlled capacitor 

The dual of the TCR is the thyristor-controlled series capacitor (TCSC), figure 2c. It uses 
a thyristor-based ac controller to continuously adjust the apparent reactance seen from 
the line. Such units have been successfully installed in transmission systems. However, 
they have the same limitations as TCRs, including harmonic injection. Performance and 
harmonic reduction may be enhanced by using force-commutated switches. 

Although ac controller-based compensators are rugged and simple to control, operat¬ 
ing regions are determined by the reactances used and by the line conditions, figure 2b. 
Converter-based structures are more versatile and flexible and are therefore receiving more 
attention. 


4. Active compensator topologies 

The availabihty of controllable voltage and current sources, realized using static power 
converters, allows the implementation of static compensators which can emulate the char¬ 
acteristics of a variable reactance, (figure 4). Advantages over the variable reactance struc¬ 
tures include the following. 

(a) The performance characteristics are improved, particularly reactive power generation 
Qref capability, and are independent of the fine parameters and line steady state 
operating voltages and currents, figure 5b. 

(b) Energy storage devices, such as battery banks (Walker 1990) and superconducting 
magnet energy storage systems (SMES) (Zhang & Ooi 1993) can be incorporated into 
the units. This allows the absorption and generation of real power, Pref, which can be 
used for power system damping and voltage support under fault conditions. 

In the shunt connection, figure 4a, the power converter system behaves in a manner 
similar to a synchronous machine operated as a synchronous condenser. If the converter is 
considered a voltage source behind an inductance, varying the output voltage is equivalent 
to varying the internal voltage of the synchronous generator, figure 5 a. The shunt connection 
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(b) 



(c) 

Figure 4. Principles of active compensation based on static power converters, (a) Shunt 
connection, (b) Series connection, (c) Series-shunt connection (UPFC). 

3 long been the preferred connection: one end of the compensator can be connected to 
)und and fault currents do not flow through the compensator. 

The series connection, figure 4b, offers a number of advantages over passive series 
tnpensation at a cost that is becoming competitive in demanding applications (Gyugyi 
al 1996). These include: (i) the elimination of the risk of resonance normally asso- 
Lted with fixed series compensation (SSR); (ii) the possibility of injecting real power 
0 the system and modifying the apparent resistance of the line. This improves the 
wer transmission system characteristics, including the possibility of increasing the X/R 
io. 

The combination of series and shunt connected converters, connected to a common 
bus, figure 4c provides the advantages of both the series and the shunt configuration 
d adds another degree of control. This structure is known as the Unified Power Flow 
ratroller or UPFC (Gyugyi 1992). In addition to its operation as a var compensator, 
s scheme can also be used as a phase shifter. Although the dc bus can be either of the 
Itage or current source type, most of the structures that are investigated are based on the 
Itage source configuration. The voltage source dc bus, figure 4c, is the standard config- 
ition for the UPFC structure. Inverter structures and their characteristics are discussed 
§5. 
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5. Converter structures 


5.1 


Current source type structures 
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(a) (b) 

Figure 7. Static power converter output waveforms. AC line voltage for voltage source or 
ac line current for current source, (a) Waveforms for single pulse patterns and pulse width 
modulation (SHE) patterns (5th and 7th harmonic elimination), (b) Harmonic spectrum for 
single pulse patterns and pulse width modulation (SHE) patterns. 


[he lagging reactive power limitation is removed when thyristors are either force- 
amutated (Walker 1986), or replaced by force-commutated devices, such as GTOs, 
ire 6a (Edwards et al 1988). The use of such devices also allows the implementation of 
TVI patterns to (a) control the level of injected current, (b) limit the amount of harmonic 
rents generated (Moran et al 1989a; Espinoza & Joos 1994). However, the structure 
uires an input capacitor and a line filter reactor, with the associated resonant frequency, 
thermore, in the standard control scheme, the current in the dc inductor is maintained 
istant at the rated value, resulting in significant losses, even when the compensator 
plies no reactive power. This has led to the development of compensators based on 
erconducting magnet energy storage devices, SMES (Zhang & Ooi 1993). This tech- 
ogy however is complex and costly and the preferred compensator configuration is 
ed on voltage source structures. 


Voltage source type structures 

Jse structures use a voltage source, a capacitor or a battery bank, on the dc side of the 
dc inverter and reflect an ac voltage on the line side, figure 6b. Therefore, an ac inductor 
St be used to couple the converter to the line. Part or, in some implementations, all of 
inductance is provided by the transformer leakage reactance. As such, the structure 
dmilar to that of a synchronous condenser, and the converter may be referred to as a 
Lchronous voltage source. 

\s in a synchronous condenser, the current injected into the ac supply is controlled by 
magnitude of the reflected voltage, figure 5a: a voltage larger than the ac supply results 
njected vars (leading or capacitive); the converse, or lagging reactive power, results if 
voltage is smaller. Methods for controlling this voltage are explained in § 6.3. Since the 
bus voltage is controlled and independent of the line voltage, the steady state operation 


of the compensator, in particular the compensating capability, illustrated in figure 5b, is 
independent of the line conditions (compare with figure 2b for a TCR-TSC conventional 
compensator). The compensator therefore provides a better dynamic response and the 
voltage support in the case of operation of the power system under fault conditions is 
improved. 

The converter operates as a reactive power source, and in steady state does not supply or 
absorb real power. Losses however must be supplied either: (i) from the ac supply for a self- 
controlled dc bus, the preferred solution, in which case an additional dc capacitor voltage 
control loop must be used and a small amount of real power flows into the converter to 
cover losses (Moran etal 1989b); (ii) from a separate dc source, usually an ac/dc rectifier. 
If the var compensator is required to supply real power under transient conditions, battery 
banks can be connected on the dc side (Walker 1990). 

Advantages of the scheme include low harmonic current injection, if harmonic voltage 
components are minimized (see §§ 6.2 and 7), and high dynamic performance. Losses are 
lower than for current source stmctures. Weaknesses include the risk of dc short circuits, 
when both switches in a leg conduct, and the requirements for output short circuit protec¬ 
tion. Current source stmctures are more mgged: the short circuit current is limited, the dc 
bus being a current source. 


6. Converter control techniques 

The basic requirement of the compensator is to generate vars, by controlling either the 
voltage or the current generated by the converter. Since most of the compensators are 
based on the voltage source topology, the discussion on control techniques will focus on 
this topology. However, the conclusions regarding gating patterns are equally applicable 
to the current source topology: by virtue of duality, current source inverter (CSI) current 
patterns are identical to the line to line voltage patterns of the voltage source inverter 
(VSI). However, the gating of a VSI is generally simpler than that of a CSI: this is the 
result of the bidirectional current capability of the switches in the VSI, which ensures that 
there always is a return path for the dc bus current; therefore, there are no restrictions on 
the gating pattern. Switches in the CSI however are unidirectional devices and therefore 
require complementary gating to satisfy the requirements for dc bus current continuity. 

6.1 Single pulse gating 

Thyristors in rectifiers and inverters can only be gated once per cycle and line current must 
always be lagging the voltage to ensure line commutation. If thyristors are replaced by 
force-commutated switches such as GTOs, this restriction is lifted and both leading and 
lagging currents can be generated. Gating of each switch in a leg for half the ac supply 
period leads to a single pulse per period: this is the conventional means of controlling large 
static synchronous condensers, figure 7. An important advantage is that this method leads 
to the lowest switching losses and the highest efficiency. However, the pattern is fixed 
and the fundamental or desired component of the voltage (or current in a CSI) depends 
upon the dc bus voltage (or current). Furthermore, the harmonic content of the inverter 



Figure 8. Multi-module pattern generation by carrier phase shifting, (a) Principles of Sine 
Pulse Width Modulation (SPWM) and phase shifted carriers (8 carriers, with only 3 shown), 
(b) Output voltage of individual modules and harmonic spectrum (carrier to fundamental 
frequency = 9). (c) Total output voltage and harmonic spectrum (4 units). 
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Figure 9. Multi-pulse compensator - voltage source topology, (a) Structure of individual 
units and output voltage waveforms, (b) Harmonic cancellation by transformer phase shifting 
and inverter output voltage waveform (24 pulse, n = 4, dominant harmonics at the 23rd and 
25 th). 


output voltage is large resulting in large injected harmonic currents. Methods to redu 
the harmonic content include large filters, a solution that is uneconomical or multi-pul 
techniques, discussed in § 7. 

6.2 PWM techniques 

In order to vary the fundamental component of the output voltage for a fixed dc bus voltaj 
Pulse Width Modulation (PWM) is used. A number of techniques have been develop( 
among which; (i) selective harmonic elimination (SHE), (figure 7) (Kojori et al 199' 
(ii) carrier PWM, the most common being the sine PWM (SPWM), (figure 8a) (Kuang 
Ooi 1994; Suzuki et al 1997), and (iii) space vector modulation (SVM) (Bakhshai et 
1997). For high power applications, SHE has been successfully applied, since it ininimi 2 
switching frequency for a given harmonic content. Criteria for selecting an aoDroori; 
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(a) Instantaneous on-line control: SPWM and SVM allow instantaneous control of the 
inverter output voltage; patterns generated by a SHE algorithm are generally stored in an 
EPROM or equivalent, which leads to a slower dynamic response. 

(b) Harmonic minimization: Direct, selective and precise harmonic elimination can only 
be obtained with the SHE techniques; other patterns rely on the fact that the dominant 
harmonics are related to the switching frequency and its multiples. The switching frequency 
is equal to the carrier frequency for SPWM, as shown in figure 9, or to 2/3 of the cycle 
time for SVM. 

(c) Maximum inverter output in the controllable range: Patterns that are generated on 
a three phase basis (SVM and SHE) maximize the output voltage for a given dc bus 
voltage; in SPWM, patterns are usually produced on a per leg basis, resulting in redundant 
switching on a line to line basis and a reduction in the output voltage. However, this value 
can be increased by injecting a third harmonic component in the modulating reference 
voltage. 

6.3 Output control 

The fundamental component of the output voltage of the inverter, V), i (line to line peak), 
can be controlled, as required for var compensation purposes, figure 5c, either by changing 
the dc bus voltage V^c or the PWM pattern, through modulation index control, M: 

Vu = KacMVdc, 

where Kac is the gain of the converter, which depends upon the PWM pattern. For a three- 
phase inverter, the gain Kac varies between 0.86 for SPWM, and 1.03 for SHE, SPWM 
with third harmonic injection and SVM, 

The dynamic performance of the compensator depends largely on the control technique 
chosen. 

(a) DC bus voltage control: If no PWM or a fixed SHE pattern designed for harmonic 
elimination only are used, output voltage control is obtained by varying the dc bus capacitor 
voltage Vdc. This is achieved by charging or discharging the dc capacitor; this can result 
in a slow response which depends upon the dc capacitor value, the dc voltage and the ac 
inductor (Joos etal 1991)', 

(b) PWM pattern control: If the PWM pattern is varied to produce a variable voltage from 
the fixed dc bus voltage, response can be very fast, in the order of the switching period. 
Instantaneous control of the injected current can then be implemented, and response is 
much faster than with conventional TCRs. 


7. High power structures 

In order to increase the power rating of the compensator to power system levels, typically 
100 MVAR or more, available power switching devices must be combined in series and 
in parallel. However, advantage can be gained by combining converter units rather than 
individual devices: these units are associated so as to minimize harmonic injection while 
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maintaining switching losses low. This significantly increases compensator efficier 
which must be kept very high (ideally 98% or higher). 

Multi-stepped voltage waveforms that more closely approximate a sinusoidal volt; 
can be obtained in a number of ways, among which are multi-level and multi-pulse or mu 
module structures. In addition to providing flexibility in the control of the fundamental; 
harmonic components, these structures allow the increase of the voltage and current rat 
to levels required for transmission systems. They also provide the option of remov 
the coupling transformer used for voltage matching, which may reduce the cost of 
installation (Hochgraf & Lasseter 1996). A number of structures are presented bel 
based on voltage source inverters, and the implementation of harmonic cancellatioi 
discussed. 


7.1 Multi-pulse and multi-module structures 

Rather than increasing the frequency of the PWM pattern to reduce the harmonic conter 
the output voltage, a number of units can be connected in series through a magnetic inter! 
circuit, figure 9b. Voltage distortion is reduced by harmonic cancellation or minimizat: 
Some multi-pulse structures use PWM. If no PWM is used, output voltage control mus 
implemented by dc capacitor voltage control, as described in § 6.3 (Schauder et al IS 
Agaki et al 1990). 

In general, harmonic cancellation is achieved by phase shifting the harmonic compon* 
to be cancelled so that, when the output voltages of individual units are added, tl 
components cancel. This can be obtained by using the following. 

(a) Phase-shifted transformer windings: Converter units are identically gated, figure 
but the fundamental component of the output is shifted by 60°/n for n converters 
applied to transformers with a phase shift of 60°/az, figure 9b; this allows total harmc 
cancellation (Mori et al 1993). 

(b) Patterns with phase-shifted harmonics: Phase-shifted carrier techniques, as show 
figure 8a for 8 units, exhibit this feature; these techniques however only provide harmt 
minimization; for example, in figure 8b, individual converters have harmonic compom 
around the 9th and multiples, whereas the combined output of 4 units, figure 8c, c 
contains dominant components around the 36th and multiples (Kuang & Ooi 1994); tr; 
fonner configurations are similar to that of figure 9b, but no phase shift is required 
standard transformers can be used. 


7.2 Multi-level structures 


By splitting the dc bus capacitor, figure 10a, it is possible to generate a stepped vol 
similar to that in figure 8b. A number of topologies are available among which the Di 
Clamped Capacitor Multilevel Inverter (DCMLI), figure 10b and the Flying Capat 
Multilevel Inverter (FCMLI) (Hochgraf et al 1994). Typically 3 to 7 level converters 1 
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Figure 10. Multi-level compensator-voltage source topology, (a) General three-phase struc¬ 
ture. (b) Power circuit for one-leg of a 4-level diode clamped converter. 


7.3 Single phase structures 

The structures presented above are based on three-phase converter units. This is the nat¬ 
ural choice for three phase systems. However, implementations based on single-phase 
converters of the voltage source type have been proposed, figure 11 (Lai & Peng 1996; 
Peng & Lai 1996). Advantages include: (i) modularity and ease of expansion, obtained by 
stacking units, particularly to meet voltage ratings; (ii) control flexibility, since control is 
implemented on a per phase basis and unbalances are more easily handled. Output voltage 
waveforms are similar to that of figure 9b for a single pulse operation (Lai & Peng 1996). 
PWM can also be used. 


8. Summary 

Force-commutated static power compensators, part of the FACTS devices family and based 
on GTOs and on IGBTs, offer an interesting alternative to conventional shunt or series, fixed 
or variable, reactance compensation. They are capable of producing the required amount of 
reactive power independently of the line voltage or current: this is the result of their voltage 
or current source nature. They can both supply and absorb reactive power. Furthermore, 
as required, they can supply real power if equipped with energy storage devices. They 
have almost instantaneous response to control variable changes. Unlike systems based 
on passive components, they do not introduce potentially damaging resonant frequencies. 
Advanced static compensators are therefore one of the key elements in enhancing the 
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Figure 11. Multi-module compensator based on single phase units - voltage source topo¬ 
logy. (a) Three-phase structure, (b) Multi-module single phase converter, series connection 
- one leg. (c) Circuit topology of each unit. 


The author wishes to acknowledge the help of Mr A Bakhshai in providing the data ; 

assembling the figures. 
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Abstract. Power electronic loads inject harmonic currents into the utility 
causing overheating of power transformers and neutral wires, the power sys¬ 
tem, unpredictable performance of protection systems etc. In addition, electric 
resonances in such loads can also cause other undesirable phenomena like 
voltage fluctuations, radio frequency interference (RFI) etc. To mitigate these 
undesirable effects, a new class of power electronics equipment (Active Filters) 
is being considered. A review of present-day solutions in the area of active fil¬ 
ters is conducted in this paper. Finally, this paper discusses the trends in the 
design of active filters and the factors influencing them. 

Keywords. Active filters; harmonics; multi-level inverter; series-connection; 
hybrid filters. 


Introduction 

le past few decades, there has been an increase in the use of electric power throughout 
world. This increase has been accompanied by the use of equipment in our day to 
life which has shown an increasing amount of sophistication and an appetite for good 
ity electric power. The past few years has also witnessed the occurrence of a number 
lishaps like distribution transformers catching fire, blackouts etc. Such incidents are 
:ting an increasing number of consumers as the use of electric power becomes more 
^spread. On investigating these accidents, it was found that one of the causes of these 
irrences was electric power “pollution”. This is generated by the use of modem ap- 
tices and equipment connected to the grid. Typical examples at the residential level 
tide computers, colour televisions, fluorescent ballasts etc. At the industrial level, high 
er Adjustable Speed Drives (ASDs) and arc furnace loads are typical examples of 
er polluting loads. These loads are used extensively in steel rolling mills, chemical 
:ess industries etc. 

dmmon undesirable characteristics of all these loads are the following. 

They draw non-sinusoidal current from the utility, and 
distort the utility voltage waveforms. 
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Table 1. Details of harmonic currents drawn by the load of figure 1, 


Harmonic number 

Ih/I\ (in per unit) 

3 

0.77 

5 

0.42 

7 

0.15 

9 

0.08 


From the point of view of studying the effect that such loads have on the utility, it 
possible to represent any of the above mentioned examples in terms of the schematic 
agram shown in figure la which consists of a diode bridge rectifier and a resistor. 

In this diagram, the power electronics processor is represented as an equivalent resistor, 
he current drawn by the load from the utility is shown in figure lb. The voltage Vbn at the 
hnt of common coupling (PCC), shown in figure Ic, is distorted due to the nature of the 
irrent drawn by the load. Thus any other consumer who is connected across points b — n 
ould have to face this distorted voltage. The current drawn by the load has a dominant third 
irmonic component as shown in table 1 which contains a table containing details of the 
irrent harmonics. The usage of such loads can lead to overheating of power transformers 
id neutral wires, electric resonances in the power system, unpredictable performance of 
rotection systems etc. In addition, such loads can also cause other undesirable phenomena 
ke voltage fluctuations, flicker, sags, radio frequency interference (RFI) etc. (Redl et al 

m). 

In order to address this issue, certain institutions like the IEEE in the US and lEC (fol- 
iwed in Europe) have established guidelines which set limits on the amount of distortion 
irrent that the load can inject into the utility. However, there is a difference in philosophy 
i the framing of guidelines between the two institutions. 

The revision of IEEE-519 guidelines suggested by Duffey & Stratford (1988) became 
fficial in 1992. It takes into account the relative strength of the utility system and sets 
mits on the amount of harmonics (as a percentage of the fundamental current) that a user 
m inject at the Point of Common Coupling (PCC). In the case of the IEC-555 standard 
'hich was issued in the early 1980s to suggest guidelines for single-phase applications, 
le emphasis is on limiting the absolute amount of harmonics that is generated at the 
[juipment level. The other major difference is that lEC standards have been enforced into 
LW while IEEE 519-1992 guidelines are still recommendations. 


. Active filtering and current waveshaping techniques 

1 order to address the issue of harmonic current injection, there are two options. 

L) Incorporate current waveshaping circuits within the equipment so that they draw si¬ 
nusoidal line currents. 

Z) Install filters to “clean up” the distorted current waveforms drawn by these loads. 


726 


Ned Mohan and Girish R Kamath 



Figure 2. Single line diagram for active 
filter acting as a harmonic current source. 


not be the most cost-effective option, especially at medium to high power level (greal 
than 10 kW). 

In such cases it may be more economical to follow the second approach by installii 
an active filter unit at the plant level in order to solve the problem. 

(1) Current waveshaping techniques: lEC standards limit the current harmonics that a 
generated by single-phase electrical appliances. This encourages the approach whe 
one inserts a power electronics block between such a load and the utility to e 
sure that it (the utility) is insulated from the effects of the nonlinear load, i.e. t 
diode bridge rectifier of figure la is replaced by a “utility friendly” block whi 
draws sinusoidal current from the mains at unity power factor to maximize efficienc 
It then generates the required DC voltage at which the power electronic process 
operates. 

(2) Active filtering techniques: IEEE 519-1992 recommendations limit the injection 
current harmonics by a user at the Point of Common Coupling (PCC). For examp 
the IEEE 519-1992 recommends a limit of 5% in weak systems on the Total Harmor 
Distortion (THD) in the current at the Point of Common Coupling. The total harmor 
distortion (THD) in the current is drawn from the utility is the ratio of the rms val 
of its distortion component to the rms value of its fundamental-frequency compone: 
It is given as, 


%THD 



j/ hx 100 


where 

Ih = rms value of the current at harmonic order h, 

7i = rms value of the fundamental-frequency current component. 

In this case, the user is free to use any equipment in his premises which are not necessar 
utility “friendly” as long as he uses external harmonic current reduction techniques 
meet the guidehnes specified. Active filters fall in this category. They are harmonic curr( 
sources which are used in conjunction with the utility to provide the required harmoi 
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Figure 4. current waveshaping technique for three-phase applications. 
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input voltage. For those cases where input isolation is not required, a topology called 
Vienna Rectifier may be used (Kolar & Zach 1994). The method described by Kolar e\ 
(1997) can be used for universal input three-phase voltages or where there are large in 
voltage variations. 

2.2 Passive filters 

Filters are a method of “cleaning” up the current waveforms drawn from the utility in ca 
where the load does not contain a waveshaping circuit. Active and passive filters are ui 
either together (to form hybrid filters) or separately (to reduce current harmonics). 

Passive filters can be connected either in parallel with the load (shunt filter) or in sei 
with it (series filter). An ideal shunt filter serves as a short circuit for load harmo 
currents. A series filter consists of an impedance connected in series with the utility i 
acts as an open circuit to load harmonic currents. The disadvantages with passive fill 
are as follows. 

(1) They are bulky; 

(2) the filter impedance can resonate with the AC system impedance, resulting in increa 
voltage distortion and damage to equipment; 

(3) overcompensation of Vars can result in lowering the power factor. 

These drawbacks of passive filters can be overcome by active filters. 

2.3 Active filters 

Active filters can also be connected in series and shunt mode. An example of an aci 
filter for a three-phase, four-wire system (Quinn et al 1993) is shown in figure 5. 



Active power filters - Recent advances 



Figure 6. Equivalent circuit of a hybnd 
filter. 
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Utility 





Figure 7. Schematic diagram of a hybrid all-switching series converter for utility applications. 


mature technology, its cost is not expected to decline. The presence of a line-frequency 
transformer also results in the need for an additional DC demagnetizing current control 
loop to prevent the transformer from becoming saturated due to non-idealities in the 
main control loop and in the power circuit. 

(2) Multi-level inverters are being considered (Baker 1980; Hochgraf et al 1994; Matsui 
1995). However, for levels greater than three, the current flow in and out of the neutral 
point results in voltage variations of the neutral. This calls for additional neutral point 
voltage control (Fukuda & Sagawa 1995; Matsui 1995) and a limitation in the switching 
frequency. 

(3) The switches in the switch-mode converter may comprise of a series connection of 
discrete devices (e.g. GTOs). However the devices are derated and operate below their 
rated voltage rating (Takeda et al 1995; Ichikawa et al 1995). 



A solution to the above problems, as shown in figure 7, is proposed by Paice & Edwards 
(1987), Mohan & Kamath (1995) and Peng eta/ (1995). It consists of a number of converters 
whose outputs are connected in series, on a per-phase basis. The arrangement has the 
following properties: It consists of a set of slow switching converters (switching at line 
frequency or its low multiple) whose outputs are connected in series with a fast-switching 
PWM converter. The DC bus voltages in these converters are independent of each other 
and may or may not be equal. The devices are matched to their switching requirements, for 
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)lock the utility voltage, while that of the fast switching converter is to inject the 
priate current. Since the fast switching converter operates at low voltage and hence 
power, it produces much lower EMI (electro-magnetic interference) as compared to 
le conventional converter operating at high power. There is also a decrease in the 
f the interface inductor as compared to a single six-switch converter topology for the 
current ripple and switching frequency. 

lulation and experimental results of a proof of concept prototype with three converters 
cted in series and tested for an active filter application are presented by Mohan & 
th (1997). With improvements in device technology, both in terms of performance 
)st, and the use of digital control techniques to improve dynamic performance of the 
Q, there is a trend towards using all-solid state solutions to power quality problems, 
s consistent with the PEBB (power electronics building block) concept where the 
• switches are combined with their gate drives and associated control circuitry into 
:le module, for a variety of applications. There have been continuing advances in 
onductor technology resulting in improvements in device switching characteristics 
ler voltages, especially with the emergence of SiC (silicon carbide) devices (Weitzel 
996). This also indicates that the trend will be towards solid-state solutions for power 
3nic applications in general and active filters in particular. 

j desire for better quality power also means an increase in the complexity of the 
ons that will have to be performed by the active filters of the future. Akagi (1996) 
•oposed the concept of a general unified power quality conditioner which consists 
eries active filter and a shunt active filter. This power conditioner, in addition to 
)nic compensation, also takes care of voltage regulation, voltage flicker/imbalance 
msation, negative sequence current compensation and electric resonance damping. 


!onclusions 

has been an increase in the use of power electronic loads causing concern for power 
y. This paper describes some of the methods that are currently being considered to 
5s this issue. 
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Abstract. Operation and characteristics of resonant converters on the util¬ 
ity line are presented. Series-parallel (LCC-type) resonant converter operating 
with discontinuous current mode and continuous current mode (variable fre¬ 
quency control as well as fixed-frequency) are considered. Design examples 
are presented. SPICE simulation and experimental results obtained for the de¬ 
signed converters (rated at 150 W) are presented to verify the theory. It is shown 
that high line power factor (>0.95) and line current total harmonic distortion 
(THD) of <25% are obtained for the LCC-type converter for a wide load range 
(from full load to 10% rated load) without any active control, and the switch 
peak current decreases with the load current. With active line current control, 
low distortion and zero voltage switching for the entire cycle are realized. 

Keywords. Resonant converters; power factor; zero-voltage-s witching; zero- 
current-switching. 


Introduction 

ting the last few years, high-frequency (HF) soft-switching (including resonant) cen¬ 
ters have gained popularity due to their advantages compared to hard-switching PWM 
iverters. Some of the advantages are; zero-voltage-switching (ZVS) or zero-current- 
tching (ZeS) reducing switching losses, high efficiency, higher switching frequency, 
all size, low weight etc. The ZVS or ZCS reduces the switch stresses during the switch 
i-on or turn-off instants. Many of the configurations utilize the leakage inductance of 
HF transformer as part of the resonant inductor eliminating the voltage spike problems 
sent in PWM converters. 

Vith an increase in the number of power converters operating on the utility line, har- 
nics generated by these converters have become a matter of great concern. Due to the 
breement (either aheadv existing or being proposed) of strict harmonic regulations 
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(e.g. IEC555), there is an increased interest in the reduction or elimination of line current 
harmonics. The methods proposed or implemented in the literature to reduce the line 
current harmonics can be classified as; 

(1) use of line filters (active or passive); 

(2) use of power converters with active control of line current (Kataoka et al 1919\ Kocher 
& Steigerwald 1983; He & Mohan 1987); 

(3) modifications to the existing converters (Schlecht & Miwa 1987; Keraluwala et al 
1991; Schutten etal 1991). The last method is the simplest solution. 

Advantages of HF resonant converters can be utilized in ac-to-dc converters (Chambers 
1983; Nijhof 1986; He & Mohan 1987; Schlecht & Miwa 1987; Keraluwala et al 1991; 
Schutten etal 199 U Belaguli & Bhat 1995; Belaguli 1996) for realizing power conversion 
with improved performance (high efficiency, high power factor and low line current har¬ 
monics etc.) while reducing the size, weight and cost. Perhaps the earliest work in this area 
was reported by Chambers (1983). The series resonant converter (SRC) operating in dis¬ 
continuous current mode (DCM) was used without any line current control. Due to forced 
shut off of the SRC when the line voltage is around 50% of its peak value, the maximum 
power factor achievable and reduction in total harmonic distortion (THD) are limited. Ac¬ 
tive line current wave shaping has been used in a single ended resonant converter (He & 
Mohan 1987). 

2. Resonant converters and their operation on the utility line 

A number of resonant converter configurations are realizable using different resonant tank 
circuits (Steigerwald 1988; Bhat 1991) and the three most popular configurations are: series 
resonant (or series loaded) converter (SRC), parallel resonant (or parallel loaded) converter 
(PRC) and series-parallel (or LCC-type) resonant converter (SPRC). Load voltage regula¬ 
tion in such converters for input supply variations and load changes is achieved by either 
varying the switching frequency or using fixed-frequency (variable pulse-width) control. 
The SRC has high efficiency from full load to part load. But it requires wide variation in 
switching frequency for power control and the output filter capacitor must carry high ripple 
current when used in low output voltage and large load current application. The PRC is 
suitable for low output voltage with large load current applications and it requires narrow 
variation in switching frequency for power control. However, peak currents through the 
switches do not decrease with load current reducing the efficiency at reduced loads. The 
SPRC takes the desirable features of both SRC and the PRC. 

Keraluwala et al (1991), Schutten etal (1991) and Belaguli & Bhat (1995) have reported 
the operation of resonant converters with high power factor. But the line current harmonic 
distortion was very high (18% minimum around half-load to about 48% at full-load) and the 
switch peak current did not decrease with the load current since PRC was used (Schutten 
er <3/1991). Use of LCC-type converter had been discussed briefly. The major problem was 
the requirement of keeping the ratio of switching frequency to series resonance frequency 
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/entionally, HF link dc-to-dc resonant converters were operated with a smooth 
jt. This means, when the converter has to be operated on the utility line, a line 
r followed by a large filter capacitor (Cj) has to be used to get a dc supply. The 
mtages of this approach are well known: High peak currents drawn from the line with 
harmonic distortion (THD) greater than 80% and low line pf (<0.65). However, 
leal single-phase ac-to-t/c converter, the input line current has to be sinusoidal and 
;e with the line voltage. This results in power with a double line frequency {2ft, 
fl is the line frequency) having a peak value of IPq, where Pq is the average output 

le proposed ac-io-dc converters, a small HF filter capacitor Ci is used at the input 
;en et al 1991; Belaguli & Bhat 1995; Belaguli 1996). The dc link voltage in this 
approximately a rectified sine wave. This allows the HF inverter to draw current 
le utility line over the entire cycle reducing the peak cuments. However, a large 
ive output filter Cd is required to filter the 120 Hz voltage ripple at the output to 
a constant voltage Vq. Due to the rectified sine-wave at the input of the HF inverter, 
erter output voltage Vab is varying and the instantaneous normalized converter gain 
= V^/vsit) must also vary in order to draw sinusoidal line current and maintain 
;ant dc output voltage. The voltage gain M{t) is minimum at the peak of the < 3 c 
and maximum near the zero crossings of the ac voltage. 

’ closely the converter gain matches the required gain variation in order to get a line 
; which follows the input sine wave depends on the resonant converter configuration. 
:ase of SRC, the converter does not have enough gain for most part of the line cycle, 
suiting in poor pf and large line cuiTent distortion. It is well known that PRC has 
ain near valley points of line voltage at full load (for switching frequencies near 
nee) due to its high Q (for the PRC, Q = Ri/{Lr/ where Lr and Cp are the 
nt inductance and capacitance, respectively) near zero crossings. This is because, 
I varies along the ac cycle, with maximum loading at the peak of ac and light loading 
^ro crossings of ac cycle. In fact, the gain obtained is much higher than required 
g the line current to approach a square-wave resulting in a higher THD of line current 
load. In addition, the switch peak currents are high in the case of PRC and does not 
se with the load current. Only limited discussion on the use of LCC-type converter 
ven (Schutten et al 1991) due to insufficient gain available at the valley points of 
iltage, and the ratio of switching frequency to series resonance frequency was kept 
1.51) to maintain lagging pf (above resonance) mode of operation resulting in 
switch peak currents. But our research has shown that LCC-type converters can 
rated with high pf and low harmonic distortion while maintaining its advantages, 
bre, an LCC-type converter is taken as the example in this paper. 


peration of SPRC on the utility line 

emulation of a resistor 




enr)/^ T 
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Figure 1. High-frequency transformer isolated ac-to-dc converter employing series- 
parallel resonant converter. Note that Ci is an HF filter capacitor. 

is a rectified sine-wave. The output dc link filter inductor current is given by 
= Idm sin^Cw^LO 

where 

Also, series resonant Qs (t) expressed in terms of time varying reflected load resistai 
achieves its maximum at the peak of the line voltage cycle, and is given by 

QsiO — ^sL/= 12^ max sin^firi^r) 

where 

fismax = {i/Q)'/VR'Lp = (L/Cs)'^HlL/yo)< 

R'lp = Vp/(2Po), M = Vp! V„, L = Ls + Li, is the external resonant inductan 
Li is tibe leakage inductance of the HF transformer. 

Also, define; y^ = Ijtfs/cOsr, o^sr = l/(LCj)^/^, is the switching frequency. 

To get a sinusoidal line current, that is, for resistive emulation, the converter gain 
to be changed by exercising active control. However, it is shown in later sections that, 
matching the converter operating characteristics using the design constraints (§ 3.3), 
SPRC can be operated on the utility line, to obtain high pf with low harmonic cun 
distortion even without active control. 

3.2 Operating modes of a SPRC 

When variable frequency operation is used, depending on the switching frequency, SP 
can operate in discontinuous current mode tDCMl or continuous current mode (CCl 
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Design constraints 

following are the design criteria to operate the resonant converters on the utility line 
high pf and low line current harmonic distortion. 

25max should be chosen to minimize the inductor rms current and also the kVA/kW 
ating of the resonant tank circuit. Higher Qs max minimizes inductor rms current, but 
ncreases its size. 

)epending on the type of control used for regulating the output voltage from full 
oad to reduced load, the range of variation in (a) switching frequency required in 
;ase of variable frequency DCM or CCM operation, or (b) phase shift (pulse width) 
equired in case of fixed-frequency CCM operation, must be minimized while choosing 
he Cs/Ct ratio to account for variation in input voltage in addition to the required 
;ain M. 

file switching frequency (or pulse width) should be chosen such that the resonant 
;onverter operates only in the modes for which it is designed for full load near the 
)eak of the supply voltage. 

rhe resonant converter generates the required gain at least from 30° to 150° (as most 
)f the total output power is delivered in this range) even without active control over 
he 60 Hz (or 50 Hz) ac half cycle, to get lower line current distortion. 

Discontinuous current mode operation 

converter is designed to operate in just continuous current mode (JCCM) (figure 2a) 
the rated minimum peak of ac line voltage with rated maximum load current and 
peration will be in DCM (figure 2b) for the rest of line cycle. The output voltage is 
iated by decreasing the switching frequency such that the converter operates in DCM. 
figure 2a when SI and S2 are turned on, resonant current ji (normalized i^) starts 
ng with positive polarity of the voltage Vab applied to the tank circuit. During interval- 
mct (normalized Vct) = 0 and the load is supplied by the resonant current until 
) = J{t) (normalized load current assumed to be constant over one switching half 
0 to enter interval-A. In interval-A, the resonant current in excess of instantaneous load 
;nt 7(0 charges the parallel capacitor Q. The gating pulse to the switch is removed, 
the diodes D1 and D2 start conducting the resonant current in interval-A due to 
•sal in current polarity. The diodes D1 and D2 turn off with zero current at the end 
terval-C2. During the additional interval-E (dead gap) shown in figure 2b, the input 
ly to the tank circuit is cut off as all the switches are in the OFF state. It must be noted 
(he series capacitor voltage remains constant during interval-E (same as the voltage 
e end of interval-C2), until the other pair of switches S3 and S4 are turned ON. The 
don of interval-E increases with decrease in load current. The operation of converter is 
iar for the other half cycle, where switches S3 and S4 are turned ON, except for change 
ilarity in voltages and currents. Note that, all the high frequency rectifier diodes in the 
ut section are in conduction during intervals-Cl and C2. 

the output section, the filter components Ld and Cd carry approximately 120 Hz cur- 
and voltage ripple respectively, under high power factor operation. The filter inductor 


amplitude in p.u. 
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Figure 2. Normalized steady-state operating waveforms obtained from PROMATLAB for 
SPRC operation in DCM. (a) Just CCM operation, (b) DCM operation. 
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Interval -C1 & C2 Interval - A Interval - E 

Figure 3. Equivalent circuit models for SPRC operating in DCM. 


to filter only HF ripple (2/^, where is the switching frequency), whereas, Cd 
liter the 120 Hz ripple. 

models required to analyse the converter during different intervals of operation 
ied as interval-Cl, A, C2, and E) for the waveforms of figure 2 are shown in 
>. In view of low switching current ripple in idit) of the output section, constant 
model has been used. Based on the above equivalent circuit models, the converter 
:n analysed using the state-space approach (it is complex due to four intervals 
ation and three state variables) (Belaguli 1996). If active control is used to get 
dal line current, the variation in switching frequency over the 60 Hz a c half cycle 
given instantaneous M{t) and Qs (0> have been evaluated using the converter gain 
n. Figure 4 shows the plot of Qs(t), M(t), and ys(t) over the 60 Hz ac half cycle, 



igure 4. Variation of series <2 j (0^ required converter gain M(t), and normalized switch- 
g frequency ratio ys to get sinusoidal line current over ac half cycle for minimum ac 
)ltage and rated maximum load conditions. 























for two different values of <2^ max and Cs/Ct ratio. However, if no active control is used, 
then it is necessary to properly choose the converter gain M — VQlVm and Qs max at the 
peak of the 60 Hz sinusoidal ac voltage cycle for a given CsjCt ratio. 

From the analysis, it was found that, for a given peak converter gain of Vj/ = 0.25, 
good compromised design values are Qsmsx = 3 and S^max = 3.5 for Cs/Ct ratios of 
3 and 4 respectively, with the converter operating in JCCM. But simulations showed that 
a capacitor ratio of 4 is preferable over 3, due to lower harmonic distortion and lower 
inverter output peak currents. 

3.4a Design example: A converter is to be designed with the following specifications. 

Average output power, = 150W; 

Input voltage. Vac = 120 to 135 V rms; 

Output voltage, Vq = 48 V; 

Full load current, /^ = 3.125 A; 

Output current ripple in id, At = ±10% of 4; 

Output voltage ripple, = ±0.5% of \4; 

Series resonant frequency, fsr = 135 kHz. 

The design calculations are done for SPRC delivering a peak power of 2Po by choosing 
the Qs at the peak of the ac line cycle (JCCM operation at this point) for rated minimum 
input voltage, i.e.. Vac — 120 V. 

Using M = 0.25, 14n = x 120, V'o = MVm = 42.5 V. 

R'j^ = V'^/i/lPo) — 6.02 Q, ys = fs/fsr = 0*4, transformer turns ratio nil — 
0.885A 

Using the definitions of Qs max and ys, the following component values are obtained for 
Qs max == 3.5 and C^/C/ = 4: 

L = 24.84 |jlH, Cs = 0.0559 |jlF, Q = 0.0139 fxF. 

The output filter Ld and Cd are designed using the relationship given by Schutten et al 
(1991) and Belaguli (1996) to meet the output ripple requirements. 

Ld = Vo/OnfsIoAi) = 150 p,H, Cd = lo/C'^OizVoA^) = 11,512|jlF. 

3.4b SPICE simulation results: The 150 W, 48 V output converter designed above has 
been simulated in SPICE to verify its performance. Due to storage limitations, SPICE 
simulation studies have been done for a converter redesigned with a switching frequency 
of 25 kHz, which does not change the operating principle and the actual results. Figure 5 
shows the various waveforms obtained from SPICE simulation for the DCM operation of 
SPRC, for different load currents and Cs/Ct = 4. The line current waveform for full load 
shown in figure 5a has a harmonic distortion of 14%. The JCCM operation of the converter 
near the peak of ac voltage waveform is shown in figure 5b. For 50% and 10% of the rated 
load, the line current waveforms are shown in figures 5c and 5d, which have distortion 
figures of 16% and 11% respectively, while the converter is operating only in the DCM 
mode. SPICE simulation studies showed that the line current THD for a capacitance ratio 
3 was higher compared to 4. The switching frequency harmonic component appearing in 
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the line current waveforms shown in figure 5 are due to insufficient HF line filtering and 
they can be filtered using appropriate LC filters on the ac side. It is also observed that with 
the choice of Cs/Ct > 4, the converter characteristics approaches that of the SRC. 

3.4c Experimental results: Based on the design presented in §3.4a, an experimental 
prototype SPRC rated at 150 W, 42.5 V output, operating on 60 Hz, 120 V utility line was 
built using IRF740 MOSFET’s in a bridge configuration and a 1:1 HF transformer. The 
SPRC converter was controlled using LD405 resonant controller. The filter components 
used were = 150 p.H and = 10, 000 (jlF. 

The various waveforms and line current harmonic spectra obtained from the prototype 
model are presented in figure 6 for different loading conditions with Cg/Ct = 4. In these 
waveforms, for reduced loads, the frequency was decreased to get the desired regulated 
output voltage keeping the on time constant. Figure 6a(i) shows the line voltage, line 
current, output filter current and output voltage waveforms, while the corresponding line 
current harmonic spectra at full load (THD = 8.73%) are shown in figure 6a(iii). The 
JCCM operation of the SPRC at full load near the peak of the ac voltage cycle is shown 
in figure 6a(ii). For 50% rated load, line current waveform is shown in figure 6b. The 
THD increased to 12.9% at 50% load and then decreased to 8.9% at 10% rated load. The 
frequency was decreased from 68.96 kHz (at full load) to 39.8 kHz (at 10% load) in an open 
loop manner, to regulate the output voltage. The resonant peak current reduced from 10 A 
at full load to 4.5 A at 10% load. The line current waveform closely resembled sine wave 
with discontinuity near the valleys for entire load range. Near the valleys, discontinuity 
observed is due to insufficient voltage drive and converter gain to meet the required load 
demand and hence the distortion. All these waveforms and results closely conform with 
the SPICE simulations. The power factor is maintained close to unity, for the entire load 
range with DCM operation of the SPRC even without active control. However with active 
control, the line power factor can be further improved by waveshaping the line current 
waveform. 

3.4d Advantages and disadvantages of DCM operation: Some of the desirable features 
or advantages of the described converter operating in DCM are as below. 

(1) Single-stage ac-io-dc power conversion without any active control with high power 
factor. 

(2) Since the converter operates in DCM, ZCS is assured for all the switches with negligible 
switching losses. Output diodes operate with ZVS. This allows very high switching 
frequency. 

(3) The capacitor C/ is placed on the secondary-side of the HF transformer to take advan¬ 
tage of the leakage inductance as part of resonant inductance. 

(4) The value of resonant inductance required is very small. The value of resonant capac¬ 
itors are also small. 

(5) The output filter inductance {Ld) is small since it has to filter twice the switching 
frequency. 
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Figure 6. Experimental waveforms for the DCM converter designed in § 3.3a. (a) At full 
load {Rl = 12 Q, Po = 150 W): (i) input voltage Vac, line current iac, filter current id, output 
voltage Vo, (ii) Vab and ii, va and vcs near the peak of utility ac voltage, (iii) line current 
harmonic spectra for iac- (b) Half load (i?/, = 24 Pg = 75 W): Vac and iac- 





(6) In the experimental converter, MOSFETs were used as the switching devices requiring 
a series by-pass diode and another anti-parallel diode. IGBTs are better suited for ZCS 
application, also use of IGBTs is expected to reduce the conduction losses. Due to 
the availability of IGBTs with fast internal diodes, series diodes are not required. 
Mos-Controlled-Thyristors (MCTs) can also be used in the converter presented. 

Some of the problems with the proposed converter are the following. 

(1) Switch peak currents are quite high. Therefore IGBTs are the better switching devices, 
but there is a limitation on the highest switching frequency. The rectifier diode voltage 
ratings are high due to peak voltage across Q. 

(2) Since variable frequency operation is used for power control, lowest frequency sets 
the limit on the size of the HF transformer. 

(3) Lossy snubbers are required. 

(4) Output filter capacitor Cd has to filter the 120 Hz ripple and, therefore, is large espe¬ 
cially for lower output voltages limiting the transient response of the converter. This 
problem exists in all single stage ac-to-dc converters with low harmonic distortion. 

3.5 Continuous current mode of operation 

The SPRC can operate in leading pf (below resonance) or lagging pf (above resonance) 
modes depending on the switching frequency (orpulse-width when fixed-frequency control 
is used), resonant component values and the load. It has been shown that operation in 
lagging p f (ZVS) mode has several advantages (use of lossless snubbers, no tum-on losses, 
use of internal diodes of MOSFETs etc.) compared to leading pf mode of operation. 

Using ac complex circuit approximate analysis, it can be shown (Steigerwald 1988; 
Bhat 1991) that the converter gain for a dc-to-dc SPRC is given by 

M = vj/= sin(V2)/[{(:rV8)(l + (Q/CO(l - . 

-H(2.y{y5 - (l/y^)})^]^'^^ per unit, (4) 

where, Qs = {L//R'^ and 5 = tt for variable frequency control. 

Figures 7a and b show the plots of the converter gain M as a function of the normalized 
switching frequency js , for capacitance ratios of 0.5 and 1, respectively, when the converter 
operates as a dc-to-dc converter with constant dc input. When operated as an ac-to-dc 
converter as explained earlier with an HF capacitor Ci, the required variation in converter 
gain M(f), switching frequency ratio ys{t) and Qsit) over the 60Hz cycle to draw nearly 
sinusoidal line current from the utility line when active control is used are plotted in figure 8. 
The sharp dip in ys curve in figure 8 is due to insufficient gain near the zero crossings at 
chosen operating point. However, if no active control is used, the operating point has to 
be chosen carefully to satisfy all the design constraints mentioned in § 3,3. For variable 
frequency CCM operation, choosing ys closer to the load-independent point on the gain 
curve reduces the range of variation in frequency required from foil load to light load, in 
addition to reduction of inverter peak current stresses. Good compromised design values 
are; igjmax = 3.2, M = V^/Vm.min = 1> = 1.153 for Cs/Ct = 0.5. Similarly for 
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Figure 7. DC Voltage gain, M for CCM operation of the SPRC. (a) Cj/C/ = 0.5, (b) C^/C, = 1. 


CsjCt = 1, the design values are M ~ 0.75, ys = 1.195. The above design values for 
the converter will ensure lagging pf mode (or ZVS) operation, in addition to generating 
the required voltage gain near the valleys of the ac voltage, while delivering rated output 
power at rated piinimum input voltage. 


3.5a Design example: A converter is to be designed with the following specifications. 
Average output power, Po = 150W, 

Input voltage. Vac = 85 to 110 Vrms, 



(a) (b) 

Figure 8. Variation of series QsV), required converter gain M(f), and normalized switch¬ 
ing frequency ratio ys(t) to get sinusoidal line current over ac half cycle for minimum ac input 
voltage and rated maximum load conditions for SPRC'operating in CCM. (a) C.,/C, = 0.5, 
(Jb)Cs/Ct = l. 
























Output voltage, Vo = 120 V, 

Output current ripple, Aj = ±20% of lo. 

Output voltage ripple, A^ = ±0.2% of V^, 

Switching frequency, fs = 50 kHz. 

The design calculations are done for SPRC delivering a peak power of 2Po by choosing 
the Qs at the peak of minimum rated line voltage. Using the relations for Qs max and 
the component values obtained are given below for the two cases. 

Case 1; Cs/Ct — 0.5, <2^ max — 3.2, ys — 1.153, M — 

V’o^MVm = 120 V,« : 1 = 1:1, = 96 L = 563.7 p,H, 

Cs = 0.0238 pF, Ct = 0.0478 jxF. 

Case!: Cs/Ct = 1, Qsm&\ = 3.2, ys = 1.195, M = 0.75, 

Vj = MVm = 90 V, /i : 1 = 0.75 : 1, = 54 L = 328.65 |jlH, 

Cs = 0.044 p.F, Ct = 0.044 pF. 


3.5b SPICE simulation results: The 150 W, 120 V output, 50 kHz converter designed 
in §3.5a (case 1) was simulated in SPICE3 to evaluate the converter performance without 
active current control. For Cg/Ct = 0.5, the harmonic distortion in the line current wave¬ 
form (figure 9a(i)) is 14.11% at full load and rated minimum input voltage. The SPRC 
operates in lagging pf mode near the peak, and leading pf mode near the zero crossings 
of the ac voltage cycle. The line current waveform for 53% load, shown figure 9a(ii) has a 
THD of 15.5%, The SPRC operated fully in lagging pf mode for decreased load currents 
due to increase in ys for regulated output. 

Similarly for a capacitance ratio of 1 and at full load, the line current THD was 19.8% 
(figure 9b), and the converter operated in discontinuous capacitor voltage mode (DCVM), 
while delivering rated output power at rated minimum input ac voltage. Since the converter 
operated in DCVM at full load and in CCVM at reduced load, the range of variation 
in switching frequency to regulate the output was larger for Cg/Ct = 1. In all these 
simulations the switching frequency fs was increased to regulate the output voltage at 
reduced loads. 


3.5c Experimental results: Based on the design presented in § 3.5a, a breadboard model 
of SPRC rated at 150W, operating on 60 Hz, 85 V to llOV utility line was built using 
IRF640 MOSFET’s in a bridge configuration and a readily available 1:1 HF transformer 
having 12 turns each. The SPRC was controlled using UC2825 PWM controller, configured 
for variable frequency operation. 

(1) Without active control - The various waveforms obtained from the prototype model 
are presented in figure 10 for different loading conditions and Cg/Ct = 0.5. The THD 
obtained for the line current waveform (full load, rated minimum input voltage) presented 
in figure lOa(i) is 13.5%. The lagging pf and leading pf operation along with the resonant 
capacitor voltage waveforms at full load, near the peak and the valleys of the ac voltage 
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Figure 9. SPICES simulation waveforms 
for 150 W (full load), 120 V output, 50 kHz 
variable frequency SPRC operating on the 
J utility line in CCM without active control 
20 = 85 V). Waveforms Vac and iac for 

(a) Cs/C[ = 0.5; (i) at full load and (ii) at 
53% load; (b) CJC, = 1: at full load. 
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Figure 10. Experimental waveforms for different load conditions for a 120 V output, vari¬ 
able frequency SPRC operating on the utility line without active control (Cj/Q = 0.5, Ld = 
500 |xH, Cd = 1000 iJtF, Vac = 85 V rms; (a) full load (Rl = 96 Q): (i) Vac and Uc, (ii) Vab 
and il, vct and ucj near the peak of ac voltage and (iii) Vat and ii, vct and vcs near 
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11. Active current control scheme block diagram for SPRC bridge (figure 1) operat- 
the utility line. 


down in figures lOa(ii) and (iii), respectively. At full load the predominant 
omponents are 5th and 7th. The THD reached a minimum of 11.9% at 53% 

5 line current shown in figure 10(b). The maximum distortion occurred at an 
requency of 54.5 kHz, at 80% load, due to over-boosting effect at all points 
: cycle. The peak current reduced from 5.4 A at full load to 3.75 A at 10% load, 
put voltage of 110 V rms, the THD reached a maximum (due to square wave- 
minimum of 34.5% and 12.8%, at full load and 53% load, respectively. For 
atput, the required variation (increase) in switching frequency for the complete 
le variation is 50 kHz to 63.93 kHz. Experimental results also confirmed the 
results for Q/Q — 1. 

active current control - In order to reduce the line current THD, variable 
LCtive current control scheme has been implemented, and the details are given 
block schematic of the active current control scheme implemented with the 
model is shown in figure 11. In order to keep the output voltage constant for 
>ut voltage and output load, and also to keep the input line current close to 
and in phase with the line voltage, the proposed control scheme is equipped 
mtrol loops, namely: 

ter voltage feedback loop: This is a slow varying loop consisting of an output 
sensing amplifier, PI compensator-1 and a multiplier. The output voltage is 
by a voltage divider (Vo^) and compared with set reference signal {Vref) using 
mpensator. The error signal Vg in combination with sinusoidal reference 1 Vacr I 
to generate the varying amplitude sinusoidal reference current ir for referencing 
er current control loop. 

er current control loop: The current controlling feedback loop is used to monitor 
ns current iac and force it to follow the mains voltage. This control loop consists 
compensator-2, with inputs as, conditioned line current waveform \iacr \ to be 
and the reference current ir. The reference current signal ir has quick control 
ine current while the output voltage error Vg has a slow control over the line 
. The result is that the dc link current varies as a rectified sinusoid and the 
; regulation is achieved by adjusting the amplitude of the voltage error signal 
ally the control voltage Vc to the UC2825 controller is generated by summation 
r ie and Vga after proper scaling and limiting circuits. 

mple experimental waveforms obtained from variable frequency active control 
5 presented in figure 12, corresponding to a capacitance ratio of 1. For rated 
nput voltage, the line current waveforms presented in figures 12a(i) and b have 









A A i tsnat ana v Jtseiaguu 


/DU 



(a)(iii) 










1 

^^SSSSj 




■ 

gm 

□ 



■Bi 


a 


■ 


^ 

u 



□ 


■ 

H 


^ i 

(20 \ 

//divV 

1 

bshi 



^fJ-S/div 






:u (2 Ay 

Idivl 








Figure 12. Experimental waveforms for a 90 V output variable frequency SPRC operating 
on the utility line with active current control (C^/C/ = \,Ld — 500 |xH, Cd = 1000 |JlF, 
Vac = 85 Vrms). (a) At full load (/?£, = 54 Q): (i) Uac and/ qc. (ii) Uafc and near the peak 
of ac voltage and (iii) Vab and ii near the valleys of ac voltage, (b) 50% load (Rl = 108 Q): 
Vac and iac 


distortion figures of 7.4% and 11.33%, at full load and 50% load respectively. As shown 
in figures a(ii) and (iii) for full load, the operating frequency near the peak and valleys 
of ac voltage are 51.28 kHz and 58.82 kHz, respectively. The converter operated with 
ZVS operation for all the switches throughout the tzc cycle with active control at full 
load. 


3.6 Fixed-frequency SPRC 


In this case, the SPRC is designed for operation with 5 = tt (equation 4) at full load with 
minimum rated supply voltage. Therefore, the design is the same as the variable frequency 
CCM case. Power control is achieved by phase shifting the gating signals to obtain a 
quasi-square-wave dab with pulse-width d. Based on the design example presented in 
§ 3.5a, the converter was simulated using SPICE (Belaguli & Bhat 1995) for variable load 
and line voltage. It was observed that a narrow variation in pulse-width is required for 
regulating the output from full load to light load. It was also verified experimentally that 
the THD of line current is reduced by active current control scheme with pulse-width being 
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changed throughout the line cycle. The major problem with fixed-frequency control is that 
the converter operates in leading pf mode for most part of the cycle. 

4. Conclusions 

Operation of resonant converters on the utility line with high pf and low harmonic distor¬ 
tion has been presented. Design examples have been presented for the variable frequency 
controlled SPRC operating in DCM and CCM. The SPICE3 simulation and experimental 
results show that, by proper converter design, one can get low line current THD and high 
pf (>0.97) with SPRC even without active control. The capacitance ratio 0.5 is preferred 
as the THD figures, the peak current stresses and range of variation in frequency from 
full load to light load are lower as compared to those figures obtained for Q/Q ratio 
1, when no active control is used. With the implementation of active control scheme, the 
pf is maintained close to unity (>0.99) with further reduction in THD. For active cur¬ 
rent control, capacitance ratio 1 is recommended as the THD figures are lower (8% at 
full load). Slightly higher current stresses obtained for C^/C? ratio 1 are due to DCVM 
operation and as a consequence require larger variation in frequency to regulate the out¬ 
put voltage. The experimental converter built had a switching frequency of 50 kHz at full 
load and this was used only to demonstrate the high pf, low THD that can be obtained 
with variable frequency operation of SPRC. However higher switching frequency can be 
used with active control scheme, as ZVS operation is maintained over the entire 60 Hz ac 
cycle. 

References 

Belaguli V 1996 Series-parallel and parallel-series resonant converters operating on the utility 
line - analysis, design, simulation and experimental results. Ph D dissertation, Department of 
Electrical and Computer Engineering, University of Victoria, Canada 
Belaguli V, Bhat A K S 1995 Characteristics of fixed frequency series-parallel resonant converter 
operating on the utility line with and without active control. IEEE Power Electronics and Drive 
Systems (PEDS ’95) Conference Record, pp 168-173 
Bhat A K S 1991 A unified approach for the steady-state analysis of resonant converters. IEEE 
Trans, onind. Electron. 38: 251-259 

Chambers D 1983 A new high frequency resonant technique for dynamic correction of off-line 
converter input current waveforms. Proceedings ofPowercon 10, paper # F-1, pp 1-7 
He J, Mohan N 1987 Input-current shaping in line-rectification by resonant converters. IEEE 
Industry Applications Society Conference Records, pp 990-995 
Kataoka T, Mizumachi K, Miyairi T1979 A pulse-width controlled AC to DC converter to improve 
power factor and waveform of AC line current. IEEE Trans. Ind. Appl. 15: 670-675 
Keraluwala M H, Steigerwald R L, Gurumoorthy R 1991 A fast-response high power factor 
converter with a single power stage. IEEE Power Electronics Specialists Conference Record, 
pp 769-779 

Kocher M J, Steigerwald R L 1983 An ac to dc converter with high quality input waveforms. 
IEEE Trans. Ind. Appl. 19: 586-599 

Nijhof E B G1986 Resonant power supply (RPS) converters: The solution for mains/line pollution 
problems. Proc. Power Conversion Int. Conf. Rec. pp 104-139 



752 


AKS Bhat and V Belaguli 


Schlecht M F, Miwa B A 1987 Active power factor correction for switching power supplies. IEEE 
Trans. Power Electron. PE-2: 273-281 

Schutten M J, Steigerwald R L, Keraluwala M H 1991 Characteristics of load resonant converters 
in a high power factor mode. IEEE Applied Power Electronics Conference Record, pp 5-16 
Steigerwald R L 1988 A comparison of half-bridge resonant converter topologies. IEEE Trans. 
Power Electron. 3: 174-182 



Sadhana, Vol. 22, Part 6, December 1997, pp. 753-780. © Printed in India. 


Single phase power factor correction - A review 
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University of Singapore, 10 Kent Ridge Crescent, Singapore 119260 
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Abstract. This paper provides a comprehensive review of the past and re¬ 
cent developments in the area of single-phase power factor correction (PFC) 
techniques. The motivation for the research in this area, and the manifold direc¬ 
tions into which the research has gained impetus, are clearly brought out. The 
various PFC techniques are broadly classified into (1) passive, (2) active, and 
(3) active-passive PFC techniques. The active PFC techniques, based on the 
output dynamics, are further classified into (1) conventional techniques which 
have slow output dynamics and (2) techniques with fast output dynamics. The 
critical issues within each PFC technique are discussed in detail. An extensive 
list of references is also provided at the end. 

Keywords. Power factor correction; single-phase; agency standards; lEC. 


1. Introduction 

Most applications requiring ac-dc power converters need the output dc voltage to be 
well regulated with good steady-state and transient performance. The circuitry typically 
favoured until recently (diode rectifier—capacitor filter) for the utility interface is cost 
effective, but it severely deteriorates the quality of the utility supply thereby affecting the 
performance of other loads connected to it besides causing other well-known problems. In 
order to maintain the quality of the utility supply, several national and international agencies 
have started imposing standards and recommendations for electronic equipment connected 
to the utility. Since the mid-1980’s power electronics engineers have been developing new 
approaches for better utility interface, to meet these standards. These new circuits have 

been collectively called (PFC) circuits. 

Reducing the input current harmonics to meet the agency standards implies improve¬ 
ment of power factor as well. For this reason the publications reported in tMs area have 
used “Power factor correction methods”, and, “Harmonic elimination/reduction methods 
almost interchangeably. Several techniques for PFC and harmonic reduction have been 
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reported and a few of them have gained greater acceptance over the others. Commer¬ 
cial IC manufacturers have introduced control ICs in the market for the more popular 
techniques. 

In this paper, we examine and review the developments in the field of single-phase PFC 
techniques. Section 2 covers briefly the background information in this area. Following 
this, § 3 outlines a few passive PFC techniques. This is then followed by § 4 which 
summarises and discusses various recent developments in the field of active PFC. Section 
5 then presents a few active-passive PFC techniques and their merits. 

2. Background 

2.1 Conventional single phase ac-dc utility interface 

Conventionally, the utility interface of a low power single phase off-line ac-dc converter 
typically consists of a simple uncontrolled rectifier feeding a bulky filter capacitor (see 
figure la). An input tt filter, not shown in the figure, is usually present in the ac side in 
commercial products to reduce the electromagnetic interference (EMI) due to the converter. 
The bulk capacitor, C/n, is designed to maintain the ripple in the bus to an acceptable 
level and also to meet the “hold-up time” requirements. The circuit draws narrow input 
current pulses around the line voltage peaks (see figure lb). 


2.1a Power factor: Power factor (PF), is defined as the ratio of the average power to 
the apparent power, 

average power Vs hi cos Oi hi ^ ... 

apparent power Vsh h 

Power factor reflects how effectively the given source power is utilised by the load. In (1), 
it has been assumed that the input voltage is sinusoidal with low distortion. Here, Vs is the 
rms input voltage, h is the rms input current, /^i is the rms value of fundamental input 
current, and 0] is the phase angle of the fundamental current. 
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Table 1. Topical input current harmonics of the single-phase 
rectifier-capacitor type interface. 


n 3 

5 

7 

9 

11 

13 

15 

17 

{hn/hi)% 87 

80 

78 

70 

63 

50 

41 

30 


Note; Even harmonics are zero in magnitude 


The ratio ls\/ h is the distortion factor, its value reflects the effect due to input current 
irmonics. The cosine of the angle 6i is the displacement power factor, its value reflects 
e conventional power factor when there is no distortion. 


lb PF and harmonic components of a conventional interface: The narrow current 
ilses drawn by the conventional rectifier-capacitor type interface is rich in harmonics. The 
pical harmonic amplitudes of such an interface is listed in table 1 as a percentage of the 
ndamental The third, fifth, seventh, ninth and the eleventh harmonics, are considerable 
magnitude. 

The displacement factor being close to unity, the PF in a conventional interface is strongly 
iked to the distortion factor. The PF of operation is also quite poor (typically 0.60). 


2 Utility issues and agency standards 

2a Problems of conventional interface: The large harmonic content (see table 1) and 
e consequent poor PF of operation of the conventional rectifier-capacitor type interface 
luses several problems to the utility supply. Some of them are listed below. References 
-4] discuss these in greater detail. 

) Due to harmonic components - Because of the non-zero source impedance in the 
ility supply, the harmonic currents flowing through it will cause the voltage waveform to 
1 distorted at the point of common coupling to other loads. This may cause malfunction 
' other loads and also of power system protection and metering devices. Besides voltage 
aveform distortion, harmonic components can also cause the following problems. 

) Overheating of the neutral line. 

,) Interference with communication and control signals. 

') Overvoltages due to resonance conditions. 

•) Overheating of the distribution transformer and distribution lines. 


i) Due to poor PF - Poor power factor of operation implies ineffective use of the 
)lt-ampere ratings of the utility equipment such as transformers, distribution lines and 
merators. Also, it places a restriction on the total equipment load that can be connected 

1 o rv-f-fir-f* \i7a11_r»liirr M/ith maYimilTn rm V f'nTTPTlt ratmC 
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2.2b Agency standards: Due to the proliferation of power electronic equipment con¬ 
nected to the utility system, the concern over the pollution of the utility supply has been 
growing stronger over the years^”'®. It has been reported that 5% of the utility generated 
power in USA is consumed by desk-top computers alone In order to facilitate the de¬ 
livery of quality power to end-users and also to utilise the existing power generation and 
distribution equipment more effectively, several national and international agencies have 
started imposing strict standards that specify the extent of harmonic pollution that can be 
tolerated in the line current. One such standard (International Electrotechnical Commission 
IEC-555-2 or EN-1000-3-2) is expected to be implemented in Europe shortly Countries 
like USA and Japan are also showing strong inclination to impose similar standards. 

2.3 Desirable features of a RFC technique 
Input side features: 

(1) Sinusoidal input current with close to unity PF operation. 

(2) Reduced EMI. 

(3) Insensitive to small signal perturbations in the load (i.e., good output-to-input suscep¬ 
tibility figure). 

Output side features: 

(1) Good line and load regulation. 

(2) Low output voltage ripple. 

(3) Fast output dynamics (i.e., high bandwidth). 

(4) Multiple output voltage, levels if needed by the application. 

Others: Electrical 

(1) Galvanic isolation between input and output. 

(2) High power conversion efficiency. 

(3) Hold-up time if required. 

(4) Universal input voltage operation (85 V-270 V ac rms). 

Mechanical 


(1) Low part count. 

(2) Smaller size and weight. 
Economical/environmental 


(1) Low cost. 

(2) Power save feature (green supply). 



Figure 2. Conventional rectifier cir¬ 
cuit with inductive filter. 


3. Passive PFC techniques 

The power line disturbances caused by the proliferation of phase controlled and diode 
rectifier circuits were of concern even in late 70s The definition of power factor 
for nonlinear circuits and passive techniques for improving it are presented in an early 
literature ^ ^. Currently, passive techniques remain attractive for low power PFC applica¬ 
tions It has been reported that power factor as high as 0.98 can be achieved using 
passive PFC techniques. The following sub-sections discuss a few of the passive PFC 
arrangements. 

3.1 Inductive filter 

Figure 2 shows a well-known scheme with an inductor inserted between the output 
of the rectifier and the capacitor. The inclusion of the inductor results in larger conduction 
angle of the current pulse and reduced peak and rms values. 

For low values of inductance the input current is discontinuous and pulsating. Typical PF 
achieved in discontinuous mode operation (DCM), with practical values for the inductor, 
is in the range of 0.65 to 0.75. Better power factor (PF) is achievable by using a larger 
value of the inductance and pushing the operation to continuous conduction mode (CCM). 
However, it is shown ^ ^ that even for infinite value of the inductance, the PF cannot exceed 
0.9 for this arrangement. 

The inductor may also be introduced on the ac side The position of the inductance will 
not affect the PF in DCM operation. Under CCM operation, however, the circuit behaviour 
itself will be different depending upon the location of the inductance. For instance, the 
presence of an infinite inductance on the a c side (i.e. CCM operation) will result in zero 
input current and zero voltage across the bulk capacitor, theoretically. However, the same 
inductance on the dc side will result in rectangular blocks of current in the input with the 
bulk capacitor charging up to the average value of the rectified input voltage. 

For lower power levels, the distribution line inductance will itself act as a good filter. 
For an office plug point (15 A), the line inductance is typically in the range of 1 to 4 mH. 
An estimate of this value can be obtained from the assumption that the ac side reactance 
Xs (= coLs) is 5% of the ratio of nominal rated voltage to the maximum current rating 
of the plug point A practical value for the line impedance seen at a particular wall-plug 
outlet, may also be obtained using the method suggested 
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Figure 3. Rectifier circuit with input capacitor Cj. 


In the scheme shown in figure 3, a small filter capacitor Q is connected across the input 
terminals of the circuit. The line inductance (not shownin figure 3) and Cs forms the first 
stage LC filter. Therefore higher order harmonics of the line frequency will undergo greater 
attenuation (typically 80 dB) resulting in better harmonic performance. It is reported 
that even for relatively small values of the inductance, a PF of 0.86 is attainable, which is 
a considerable improvement over the no-capacitance case. 

3.2 Resonant input filter 

Figure 4 shows the series filter arrangement for power factor correction ^ 3-i4 ^ which results 
in good power factors as high as 0.94. Thus, harmonic performance is also good. However, 
the power factor depends upon the resonant quality factor which is load dependent. Here 
the bandpass filter is designed with a centre frequency equal to the supply frequency. The 
quality factor “Q” determines the bandwidth and hence the harmonic content of the supply 
current. High “Q” (narrow bandwidth) will result in reduced harmonic content and close 
to unity power factor. This circuit arrangement is popularly used in applications where the 
supply frequency is high. One such application is the space platform 13-14^ supply 
frequency up to 25 kHz. 

Some authors suggest the use of parallel resonant filter (see figure 5) for PF im¬ 
provement. With this arrangement power factor close to 0.95 is achieved. The filter is tuned 
to offer a very high impedance to the third harmonic component (the most predominant). 
The high value parallel resistor is added to damp out circuit oscillations. 



Figure 4. Rectifier circuit with series resonant filter. 
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Figure 5. Conventional rectifier circuit with parallel resonant filter. 


3.3 Evaluation of passive techniques 

Passive filters may prove to be a good solution for switch mode power supplies 
(SMPS) operating at lower power levels (<200 W). Typical applications are TVs, VCRs, 
other home entertainment electronics, and perhaps some of the lower powered office au¬ 
tomation equipment such as computers, FAX machines etc.. It is to be noted that in these 
applications the real aim of the filter is to contain the harmonics within the agency specified 
limits ^ and not improvement of power factor per se At such power levels the passive 
solution is economical and more efficient and may also result in size benefits. How¬ 
ever, even for low power levels, there are a few factors that make the passive solution less 
attractive than the active solution in certain cases. The factors are as below. 


Universal input voltage range (85 V to 280 V ac, 47 to 63 Hz): It is desirable for both the 
manufacturers and the users that the equipment works in all these supply conditions with¬ 
out any modifications. This requirement is difficult to meet using passive PFC technique 
whereas it is an inherent feature in many active PFC techniques. 


Optimal use of the wall outlet power: Passive PFC technique results in lower PF than 
active PFC technique. Therefore, passive PFC technique is less attractive when priority is 
given to optimising the use of the wall plug’s volt-ampere capacity rather than just meeting 
the agency standards. Optimising the wall plug’s VA capacity allows a user to plug more 
equipment to the same wall outlet. 

For higher power levels, however, the passive solution suffers from disadvantages such 
as: 

(1) Large size of the reactive elements 

(2) May not be able to contain the harmonics within the agency specified limits (the agency 
requirements are different for different power levels ^). 

(3) Poor power factor compared with active schemes. 

(4) Not cost effective. 

However, the oassive solutions are simple to understand and implement, besides being 








The active PFC technique, which involves the shaping of the line current using switching 
devices such as MOSFETs (metal oxide semiconductor field effect transistors) and IGBTs 
(insulated gate bipolar junction transistors) is a result of advances in power semiconductor 
devices and microelectronics. For low and medium power ranges up to a few kilowatts 
(<5kW), MOSFETs are by far the popular choice for PFC because of their switching 
speed, ease of driving and ruggedness. BJTs and more recently IGBTs are used for high 
voltage medium power applications which MOSFETs are unable to contend with owing 
to their large on-state resistances. 

For achieving good input current waveshaping using active techniques, typically the 
switching frequency should be at least an order of magnitude greater than 3 kHz (= 50 x 
60 Hz = 50th harmonic of line frequency). With modem advances in MOSFETs and 
IGBTs, this is feasible. 

The use of active PFC techniques results in one or more of the following advantages. 

• Lower harmonic content in the input current compared to the passive techniques. 

• Reduced rms current rating of the output filter capacitor. 

• Near unity power factor (0.99) is possible to achieve with the Total Harmonic Distortion 
(THD) as low as 3-5%. 

• For higher power levels active PFC techniques will result in size, weight and cost benefits 
over passive PFC techniques. 

The following sub-sections present the recent advances in single phase active PFC tech¬ 
niques. The active PFC techniques have been classified into two broad categories in this 
survey. 

(1) Active PFC techniques with poor load dynamics. These have been referred to in this 
paper as “Conventional active PFC techniques” (§4.1). They are typically followed 
by a dc-dc downstream converter which caters to the demands of the load. 

(2) Active PFC techniques with fast load dynamics (§ 4.2). Here, the PFC unit is capable 
of meeting the fast dynamic requirement of a typical load. 

4.1 Conventional active PFC techniques 

Here, there are basically two approaches. One approach is to use current-source-type cir¬ 
cuit , in which the PFC acts as a current source feeding the load. Using the other ap¬ 
proach results in the well known voltage-source-type circuits discussed 20-23,25-46,52-54^ 
57-68,70-72,75-95,102 though the voltage source type circuits are more popular, the current 
source type circuits are useful in certain niche applications. In the following subsections 
both types of circuits and schemes are discussed. 

4.1a Topologies: The current-source-type PFC converter is usually of buck type. 
However, for the voltage-source-type PFC converter, any one of the basic dc to dc converter 
switching cells, such as buck, boost, buck-boost and Cuk converter, can be used. Among 



re 6. Circuit variations of the boost PFC topology. Single-switch (a), two-switch (b), four- 
h (c), and half-bridge (d) boost PFC’s. 


boost and the buck-boost topologies are more popular. In the following sub- 
le notable features of these topologies (both current and voltage source types) 
ti critical issues related to these topologies are presented. 

t topology - In figure 6, some of the popular boost circuits are shown. Several 
L the boost converter such as the position of the inductor on the input side (reduced 
ent ripple and EMI), and a. dc voltage gain greater than unity, make it a natural 
■ active PFC application The single-ended boost converter (figure 6a) is more 
le to its use of a single active switch and also due to the ease of driving the switch, 
odes of operation are possible for the single-ended boost converter topology. 

nuous conduction mode (CCM) 
laryofCCMandDCM^'^. 
ntinuous conduction mode (DCM) 

M: This is a very popular choice for medium and high power applications, 
e supply ripple current due to switching is low because of the presence of the 
;t input inductor. Hence, the input filtering requirement is relatively low. Isolated 
" boost PFC converter is also available The following issues are to be noted 
i to the boost converter operating in CCM. 

^.covery loss: The reverse recovery loss of the output diode is a serious problem 
gle-ended boost converter operating in CCM The output diode should 
everse blocking voltage equal to the output voltage, which for the universal input 
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This reverse recovery of the diode also increases the tum-on losses in the boost switch. This 
factor places a limitation on the switching frequency. Hence, the size of the unit cannot be 
reduced beyond a certain limit. The reverse recovery also causes increased EMI 


Charge dumping loss: Another factor that limits the switching frequency is the parasitic 
output capacitor (typically around 150 pF) of the boost switch, usually a power MOSFET. 
The capacitor’s energy is lost in the channel of the switch during tum-on; the resulting 
power loss could be significant at high switching frequencies. 


EMI performance: In a conventional off-line converter (figure 1), the line current is nar¬ 
row and peaky. However, the waveform is still smooth with very little switching component 
owing to the presence of a bulky capacitor, Cin, at the output of the bridge rectifier feeding 
the dc-dc converter (see figure la). As a result, the input EMI filter size can be small. In the 
case of boost PFC converter, however, the switching frequency components are directly 
fed into the supply lines. Hence, a large EMI filter is required in spite of the presence of 
the boost inductor, L. The issues related to EMI filtering are presented ’^3,74^ 


Cusp distortion: The single-ended boost converter (figure 6a) operating in CCM suffers 
from line current distortion near the zero crossings 43,54 jg because the reference 
current slope at zero crossings is higher than the slope of the charging current of the 
boost inductor which defines the maximum current rate at which the input current can be 
increased. The effect of this “cusp” distortion is to introduce fine current harmonics. In 
DCM operation, cusp distortion may be lower, as only a small value of inductor will be 
used. 

One or more of the foregoing problems may be addressed by using loss-less switching 
techniques. By turning the switch on during the instants when the voltage across it is zero 
(zero voltage switching - ZVS) the problem of loss due to charge dumping is solved. ZVS 
also helps to reduce the EMI. In converters using resonant techniques to realise ZVS, 
even though switching losses are minimised, conduction losses and switch voltage stresses 
are higher compared to those using PWM control. This problem, however, is less severe in 
converters using PWM soft switching techniques 1^3-105 105^ ^ switching 

technique for a boost PFC circuit. 


Boost in CCM - DCM boundary: The converter operating at the boundary of CCM and 
DCM is also a popular PFC technique used mainly for low power applications such a 
electronic ballasts for lighting The operation of the converter in CCM-DCM boundary 
eliminates the problems due to reverse recovery of the output diode. However, the charge 
dumping loss is not prevented. The high value of the switching currents places a severe 
stress on the switch, diodes and the output capacitor. The input filtering requirements are 
also high. It is this factor that limits its application to low power levels. 


Boost DCM: As a front-end PFC converter in a cascaded scheme (§ 4.3a), the boost 
circuit operating in DCM mode is seldom used, as it offers no special advantage over that 
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Figure 7. Flyback PFC converter. 


jrating in the boundary of CCM and DCM. However, operation in DCM is popular 
single-stage PFC implementations (see § 4.3c) that use boost circuit as front part of 
converter. Unlike the flyback converter discussed in the following paragraphs, a boost 
iverter operating in DCM does not yield a sinusoidal input current (neglecting the 
itching components) if the duty ratio is held constant over a line cycle. 

Buck-boost or flyback topology - The flyback topology (figure 7) is attractive for low 
ver applications. 

The start-up inrush current problem faced by the boost topology is not present here. 

The implementation of the overload protection is simple compared with the boost 
topology. 

The output voltage may be greater or less than the peak input voltage. 

Implementation of galvanic isolation between the input and the output is simple with 
flyback topology. 

Both CCM and DCM modes are possible for flyback PFC converter But in either 
ide the input current is chopped resulting in more noise and EMI than with a compa- 
ile boost topology. However, the DCM operation eliminates the diode reverse recovery 
)blem. 

Ihe DCM operation of the flyback is popular because of the simplicity of control, 
lusoidal input current (after filtering the switching component) is drawn automatically 
he switch duty cycle is maintained constant over one line half cycle 
Watson et al ^®3,104 ^ flyback topology as shown in figure 8. Here, the switch S2 

ith its anti-parallel diode) together with clamp capacitor Cc forms an active-snubber 
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Figure 9. Power circuit of the half-bridge 
buck converter. 


network. Due to this, the energy stored in the transformer leakage inductance Lin is not 
dissipated; instead it is recycled while minimising turn-off voltage stress across the power 
switch SI. Furthermore, the ZVS of switch SI lowers output rectifier di/At resulting in 
reduced rectifier switching loss and output switching noise. 


f Hi) Buck topology - In voltage source type buck circuits, the input voltage must be 
greater than the output voltage. Due to this, during certain intervals when the input voltage 
is less than the output voltage, no current will be drawn from the input; hence, the net 
input current will be nonsinusoidal. But the advantage of the buck converter is that it can 
provide current limit support due to the series buck switch 

In the application reported in references the input voltage varies over a very wide 
range (130 to 600 V) but the output voltage desired is 400 V. Thus, the boost converter alone 
is not suitable. So, a topology with (Buck + Boost) operation is adopted to maintain the 
input current sinusoidal. When the input voltage is greater than 400 V, the buck operation 
is effective. Below 400 V, the boost operation is performed. 

Unlike the voltage-source-type buck converter, the current-source-type buck PFC con¬ 
verter operates for all values of the input voltage drawing a sinusoidal current. To achieve 
this, however, the instantaneous output current has to be maintained at a higher value than 
the instantaneous input current. 

A three-phase buck converter operating as a current-source-type PFC converter is also 
reported The application of the idea to a single-phase case is also mentioned 
References deal with the single-phase implementation itself. Reference discusses 
a two-switch current source type PFC circuit (figure 9) which has been derived from a half- 
bridge voltage source type PFC converter based on the duality principle. Note that, 
in these converters, a diode must be added in series with the switch(es) to realise bipolar 
blocking capability; this results in additional conduction power loss. The current-source- 
type buck PFC converter is particularly useful in application where the output voltage must 
be reduced to low values, such as in a dc motor control or ac-dc rectifier with over current 
limit. 


4.1b Input current control techniques used in conventional PFC converters: (i) Peak 
current mode control —In this constant frequency control method, the switch (peak) current 
is sensed and made to track the desired sinusoidal reference current The current 

reference is obtained by multiplying the error signal of the output voltage control loop with 
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; sinusoidal template obtained from the input voltage. The peak current mode control 
sasy to implement and has an inherent fast current limit protection. Several integrated 
cuits such as ML4812 have been developed to implement this control. 

The peak current mode control technique is not suitable for the buck and the flyback 
C topologies due to the large error between the switch peak current and the average 
)ut current which is to be controlled. The control method is however suitable for a boost 
nverter, though there will be a “peak to average” error in this topology also. 

The peak current mode control technique suffers from duty cycle dependent instabil- 
(“subharmonic oscillation”). In a dc-dc converter, this problem can be eliminated by 
Dviding “external ramp compensation” In case of an ac-dc PFC converter, external 
np compensation is not very effective as the operating point, which is line and load 
pendent, varies over a wide range. Furthermore, providing excess ramp compensation 
ids to input current distortion, particularly around zero-crossings of the input voltage. 

) Average current mode control - By controlling the average input current instead of the 
ak 20,54,55,51^ subharmonic oscillation problem is avoided. Good input current wave 
aping is achieved as the average line current is directly controlled. Due to the averaging 
ter used, the dynamic performance of the current control loop is not fast, which however 
not seen as a major limitation in PFC applications. Several commercial integrated circuits 
ch as UC3854 are available for the implementation of this control for boost PFC 
nverter operating in CCM. 

i) Charge control - In charge control, the average input current is controlled on a cycle- 
-cycle basis giving rise to fast current loop response. Though this technique is general, 
is particularly useful for flyback PFC^^’^^ where the use of average current control may 
it give the desired performance. As with peak current mode control, the charge control 
:hnique is also reported to exhibit subharmonic oscillation 

;) Hysteresis current control (HCC) - In this control method, the input current is made 
switch within a reference current window called the hysteresis band 30>32,34,45,46,94 
lere are many variations in this technique such as constant hysteresis band, and variable 
^steresis band The variable frequency operation of this method is seen as one of its 
sadvantages as the energy storage elements including input filter may have to be sized 
r the lowest frequency of operation. HCC technique, however, offers several advantages 
ch as ease of implementation, and fast and robust current control. 

) Sinusoidal pulse width modulation control - Here, just as in inverter applications, a 
angular carrier-wave is modulated with a sinusoidal modulating signal to generate the 
ive pulses for the switch(es) Closed loop regulation is achieved by varying the depth 
■ modulation. 

i) Delta modulation control (DMC) - This technique which is the dual of the HCC 
ethod, is usually used with current-source-type ac-dc PFC converter (see figure 9). 
ere, the capacitor (Cy) voltage is made to switch within a predetermined band about the 
ference sinusoidal voltage. For a given sinusoidal input voltage, the input current then 
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automatically becomes a sinusoid. The power factor of the input current can be adjusted 
to any value by appropriately adjusting the phase of the reference voltage. However, DMC 
is parameter sensitive and relatively complex leading to the proposal of the next control 
technique 

(vii) Inductor voltage control (IVC) - In this technique the input inductor (Ly) 
voltage (figure 9) and hence the input current is controlled to follow a sinusoidal refer¬ 
ence voltage, resulting in a significant performance improvement over the DMT, such as 
robustness, simplicity and direct input current control. 

4.1c Modelling of RFC converters: (i) Small signal modelling of conventional RFC 
circuits-In most PFC applications, the output voltage of the PFC is dynamically regulated 
against load and line variations, using closed loop control. In order to design a good 
compensator circuit based on analysis, linear small-signal models are necessary for the 
power and control circuits. These models are dependent on the topology and the control 
technique used. 

Mohan et al present the small signal model for the boost converter operating in CCM 
with the load assumed to be resistive. However, in most cases, the load of a conventional 
PFC would be of constant power type with negative impedance characteristics. The small 
signal model for the boost converter with this type of load is also discussed 25,57,67,68 _ 

(ii) Large signal modelling of RFC converters - The large signal “averaged” model 
can be used to obtain several steady-state results of the system easily. Besides steady- 
state analysis, the averaged model is very useful in efficient simulation (using SPICE, 
SABER etc.) of complex systems such as the PFC circuit. Both simulation time and the 
memory storage requirement for the output are significantly less than in the simulation 
using actual switches. Due to these, unlike with a switched model, the averaged model 
simulation can be easily run several times to optimise converter and controller performance. 
There are a variety of approaches to obtain an averaged model. These are dealt with in 
detail in other references . The averaged model concept for PFC application is used 
extensively 46,5 1 ,93,94 ^ 

The “switched” model with ideal switches, despite its drawbacks such as long simulation 
time and large secondary storage space, is still useful and necessary in the study of switching 
transients and in the determination of quantities such as peak voltage and current stress 
of the device, duty ratio and switching frequency variations. Thus, the averaged and the 
switched models complement each other in their usefulness. 

4.2 Rroblems in conventional active RFC techniques 

All conventional PFC techniques discussed in § 4.1 suffer from some drawbacks which 
are discussed in this section. These form the motivation for the “Active PFC techniques 
with fast load dynamics” discussed in § 4.3. 




Figure 10. Input-output properties of a PFC converter. 


)us input and output power must be equal. With sinusoidal input current and voltage, 
instantaneous input power is a sine-squared function (figure 11) consisting of a dc 
ver component plus an ac component with frequency at twice the line frequency. At the 
put port, the output voltage is regulated to a constant value and hence the load draws 
istant (dc) power only. Due to this, there must be a bulk energy storage element (such as 
in figure 10) to absorb the second harmonic input power. The amplitude of the second 
monic power is the same as the dc value delivered to the load, hence requiring a large 
)acitor. 

rhe second harmonic energy can also be stored in an inductor such as in current-source- 
le PFC converters. However, with the present technology, the size and cost of a capacitor 
generally much lower than a corresponding inductor. This is the main reason for the 
isent popularity of voltage-source PFC converters over current-source PFC converters. 

:b Slow output dynamics: A typical closed loop control system for a PFC is shown 
figure 12. In order to have fast dynamic response for sudden changes in line voltage 
i load, the bandwidth of the loop must be high. But, as the magnitude of the second 
monic voltage component at the output is quite high (even with large Cb), the control 
nal Vc (figure 12) will then have a large second harmonic component. However, in 
ier to achieve a sinusoidal input current it is essential that Vc be slow varying and be 
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Figure 11. Input and output power waveforms. 




Figure 12. Block schematic of a typical closed loop system for PFC converter. 


maintained constant at least over one half of the line cycle. Due to this design conflict, the 
requirement of fast dynamics is usually sacrificed in order to achieve good input current 
waveform. The typical bandwidth of the outer voltage loop will be of the order of 5-20 Hz, 
resulting in slow output dynamics. 


Note: The response for line voltage variations, however, is usually good due to a input 
voltage feed-forward that is typically adopted 

4.3 Active PFC techniques with fast load dynamics 

As pointed out in § 4.2, due to the existence of the second harmonic problem in conven¬ 
tional active PFC techniques, the output voltage-control-loop will have a poor response. 
Approaches to overcome this limitation are discussed in the following sections. 


4.3a Cascaded scheme: This scheme is currently very popular. Here, the conven¬ 
tional PFC is cascaded with a down stream dc-dc converter (see figure 13). The downstream 
converter addresses the load requirements such as providing fast dynamics, tight regula¬ 
tion etc., and the PFC converter meets the line requirements, thus ensuring independent 
control over the input and output requirements. Also, the resulting PFC arrangement is 
modular in construction. Several ICs are available, ML 4819 for example, with the PFC 
and DC/DC converter control functions integrated. With all its advantages, the cascaded 
scheme still suffers from the following drawbacks. 

(1) The rated output power is handled twice resulting in poor efficiency. 

(2) Having separate full power rated modules for the PFC and the downstream dc-dc 
converter entails higher cost, weight and size. 



Figure 13. Power flow in a cascaded 
scheme. 












Figure 14. A single-stage PFC scheme - Boost integrated with flyback rectifier/energy 
storage dc-dc converter (BIFRED). 


3b Second-harmonic filtering scheme: The dynamics of conventional PFC converters 
n be improved by filtering-out the second harmonic component of line frequency from 
tering the error amplifier stage of the control loop by using a notch filter The 
'or amplifier can now be designed to have a wide bandwidth and high-gain for the 
v frequency range resulting in fast response. However, with the use of this technique, 

I corrective action will be taken by the loop for perturbations at around twice the line 
jquency. It must be noted that, this technique does not prevent the second harmonic of 
5 line frequency from reaching the load; it only avoids it from reaching the control loop. 


5c Single-stage PFC schemes (S^PFC): A variety of single-stage schemes featur- 
y tight output regulation and near-sinusoidal input current with power factor close to 
ity have been proposed Most single stage schemes, such as the BIFRED^^’^^, 
BRED DITHER are realised by combining the two power stages of a cascaded 
beme into a single power stage allowing the active switch(es) to be shared (figure 14). 
5cause of the sharing of the switches, the control freedom is greatly reduced. Also, the 
Itage and current stresses of the active switches are typically much higher than those of 
e switches in the cascaded scheme, thereby resulting in less efficiency. Often variable 
jquency PWM techniques '79,82,83 qj. resonant techniques are employed in S^PFC 
heme. 

In many variable frequency PWM techniques, two control variables (say switch on-time 
d frequency) are still used for meeting the line and the load requirements independently, 
van et al however, report a single-stage scheme operating with fixed frequency PWM 
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Figure 15. Input and output power 
waveforms to illustrate P^PFC concept. 









technique. Here, a two-switch forward converter is integrated with the boost PFC. One of 

the switches is shared by both the converters. 

Generally, single-stage schemes suffer from one or more of the following drawbacks. 

(1) Suffers from all the drawbacks of DCM operation of conventional PFCs. 

(2) In many cases, the voltage across the bulk energy storage capacitor is uncontrolled 
and can reach high values. A higher rated capacitor will result in increased cost and 
greater power loss due to larger ESR (Equivalent Series Resistance) values. 

(3) The frequency varies in many cases over a wide range (typically 1; 8) making the EMI 
filter design difficult. 

(4) Increased stress on devices. 

(5) Efficiency of power conversion is generally low. 

(6) Complex control. 


4.3d Partially parallel PFC schemes (P^ PFC): In figure 15, the input and outputpower 
waveforms are divided into three (time) zones. In zone-2, the input power processed by the 
PFC is greater than the output power demand. In a conventional PFC scheme, this excess 
energy flows into the output bulk capacitor or inductor. In the P^PFC scheme, however, 
this excess power is diverted to a bulk capacitor (or a bulk inductor) within the system. 

During zone-1 and zone-3, the input power processed by the PFC is less than the output 
power demand. Therefore, in order to meet the load power requirement, the excess energy 
stored previously in zone-2 is now released to load (through another power conversion). 
The sum of energy deficits during zones 1 & 3 is equal in value to the excess energy stored 
within the system during zone-2. 

It can be shown that the average value of the excess power during zone-2 is equal to 
32% of the output power Pq. Thus, in this scheme, 68% of the output power is directly fed 
to the load after one-time processing by the PFC stage. The remaining 32% is however 



Figure 17. Power flow in a feed back type of 
P^PFC scheme-2. 
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■ocessed twice before it is fed to the load. As a result, the overall efficiency of such a 
heme can be expected to be higher than that of the cascaded scheme. The challenge, 
5re, lies in designing an appropriate control scheme that ensures fast output dynamics 
hile enabling the input current to be a sinusoid with PF close to unity. 

Figures 16 & 17 show two possible power flow structures that can be used to implement 
e P^PFC concept. Several circuit implementations are possible to realise each of the 
)wer flow structures. In the.circuit (figure 18) proposed the ac-dc PFC stage is 
iplemented using a flyback converter operating in DCM. The dc-dc converter stage is 
iplemented using a buck converter. The flyback converter delivers 68% of the output 
)wer directly to the load. The remaining 32% is handled once by the flyback PFC stage 
id for the second time by the dc-dc converter stage before it is fed to the load. A few 
her circuit implementations conforming to the P^PFC power flow stmcture are also 
ported past output dynamics is achieved by appropriately controlling the PFC 
age and the dc-dc converter stage. 

In figure 19, a circuit based on flyback topology that implements P^PFC scheme-2 is 
lown , Here, the PFC stage feeds the rated power to the load, but part of the output 
)wer (32%) is fed back to itself via Cb to be processed again during zones 1 and 3. 


3e Output shunt PFC scheme (OSPFC): This scheme is referred to as Reactive 
lunt Regulator Scheme and is shown in figures 20 and 21. Here, the second harmonic 
)wer processed by the PFC stage is diverted to the dc-dc converter stage and the dc power 
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Figure 20. Power flow in OSPFC scheme. 

is fed to the load. It is easy to see that in this arrangement, the dc-dc stage processes 64% 
(i.e. 2 X 32%) of Pq while the PFC stage processes full power Pp. Hence, the improvement 
in efficiency over the cascaded scheme may not be significant. 

4.3f Input-shunt PFC schemes (ISPFC): Unlike the OSPFC scheme, the shunt con¬ 
verter here is an ac-dc bi-directional converter and it is located at the ac input side. The 
literature reports two approaches for the ISPFC scheme which are discussed in the follow¬ 
ing paragraphs. 

(i) Scheme-1 - This scheme (figure 22) has been in use for several years in large power 
systems for harmonic elimination ^0-64 However, this can be successfully used for cor¬ 
recting the power factor of small systems also^"^. The off-line converter in the main power 
flow path draws the well-known nonsinusoidal current (see figure lb). In order for the net 
input current to be sinusoidal, the ac-dc shunt PFC stage is made to draw a current which 
is the difference of the desired sinusoidal input current and the off-line converter’s input 
current. It must be noted that the PFC stage does not process any active power; it only 
handles the harmonic power. Because of the high value of the peak input current drawn 
by the off-line converters, the ac-dc PFC stage should be designed to handle high current 
stress. 

(ii) Scheme-2 - This scheme 9^,106^ consists of a cascaded scheme paralleled at the 
input with an off-line converter (figure 23). With this arrangement, the net input current is 
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Figure 22. 


Input-shunt PFC scheme-1 


controlled in such a way that it is sinusoidal and in phase with the input voltage. Therefore, 
(as in scheme-1) the current drawn by the PFC stage will be the difference of the sinusoidal 
input current and the narrow and peaky current drawn by the rectifier-capacitor front- 
end. Again, as in scheme-1, the ac-dc PFC converter here handles bi-directional power 
flow. However, as it also processes active power, the peak current stress here is relatively 
low ^4,106 Pqj. ^ certain range of power sharing between the two parallel paths, this scheme 
may result in a better efficiency than the cascaded scheme ^4>106 

At the output, the dc-dc converters #1 and #2 may each be supplying independent loads 
(figure 23a), and may each have multiple output voltage levels. However, for the case 
when there is only one load, the outputs of the two dc-dc converters may be paralleled 
to feed the load (figure 23b). For ensuring stable parallel operation, current-mode control 
technique may be used in the dc-dc converters. The sharing of the load power may also 
be programmed using this control technique. 


4.3g Fully parallel PFC scheme (FPPFC): A fully parallel PFC scheme is proposed 
in which a PFC stage is paralleled, both at the input and the output, with a conventional 
off-line ac-dc converter with rectifier-capacitor front-end (figure 24). The power flow is 
shared between 6? c-^/c converter and ac-dcPFC stage. Thus, the overall power is processed 
once only. Due to this arrangement, the net input current is nonsinusoidal; though the PFC 
stage draws a sinusoidal current, the off-line converter draws the well-known narrow and 
spiky currents. The main aim in this scheme is to only contain the line current harmonics 
within the agency specified limits rather than achieving a high power factor. At the output, 
fast dynamics is achieved by using the off-line converter to cancel the second harmonic 
component injected by the PFC stage. Here again, it is shown that a 68% power sharing by 
the PFC stage and 32% power sharing by the off-line converter results in good performance. 
This technique may prove to be economical for lower power levels. 


ConvenHonal off-line converter ConvenUonel off-line converter 
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Figure 23. Input-shunt PFC scheme-2 (ISPFC-2). (a) Independent loads, (b) Outputs paralleled. 







Figure 24. Block diagram of a fully parallel 
scheme (FPPFC). 


5. Active-passive PFC techniques 

As the name indicates, active-passive PFC involves the combination of active and pas¬ 
sive techniques for PFC/harmonic reduction. The main motivation for this technique is 
improvement of efficiency (over the cascaded scheme) and possible size and cost reduction. 

The input currents of the active-passive schemes reported so far^®~^^, are not pure 
sinusoids. The attained PF though low (0.75 to 0.9) is, however, significantly higher than 
the conventional off-line converter with rectifier-capacitor type interface. Active-passive 
scheme can be used for applications where the main aim is to contain the line current 
harmonics within the agency specified limits rather than achieving a high power factor. 
This technique may prove to be economical for lower power levels. 

In the scheme by Elmore et a conventional boost PFC is provided with a bypass 
passive path consisting of the boost inductor, boost output diode and the output capacitor. 
The boost switch is not operated around the line voltage peaks; during this interval, the 
passive path conducts. A simple open loop control is adopted which results in large 
output voltage variations and uncontrolled power sharing between the two parallel paths. 

Oruganti & Thean suggest a loop control scheme, in which both the output voltage 
and the power sharing are directly controlled. The control scheme, however, is somewhat 
complex. The merits and demerits of the active passive scheme is given below. 

(1) The diode reverse recovery loss problem, occurring in a boost converter, which is 
maximum around the line voltage peaks is now reduced. Hence, efficiency is improved. 
Or else, for a given efficiency, the switching frequency can be pushed higher to achieve 
cost and size reduction. 

(2) Reduced EMI as the line current wave form does not have any switching current 
component around the peaks. 

(3) Input current is non-sinusoidal and hence higher harmonic content. Power factor is 
also poor when compared with the conventional PFC. 

(4) Does not alleviate the problem of slow output dynamics. Thus the active-passive PFC 
stage must be further cascaded with a dc-dc converter. 

(5) Not suitable for universal input voltage range as the output voltage varies with ac input 
voltage magnitude. 

6- Conclusions 

As seen in the paper, the research activities in the area of single-phase PFC can be identified 
as falling under three categories: (1) Passive PFC technique, (2) active PFC technique, and 
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(3) active-passive PFC technique. Passive PFC techniques result in significant improve¬ 
ment in PF and harmonic performance compared to the rectifier-capacitor type interface. 
However, in general, these are not superior compared to what is achievable using active 
PFC techniques. It is stated in many publications that passive PFC technique does not offer 
cost, size, and weight benefits. However, such a statement is very general. This is because, 
if the main aim is to contain the line current harmonics within the agency specified limits 
and not improvement of PF per se, then the passive PFC technique may prove to be a better 
choice for lower power levels (<200 W). A designer must investigate this aspect further 
for his/her specific application. 

The work in active PFC technique was classified in two ways, in this paper; (1) Con¬ 
ventional active PFC (poor output dynamics); (2) Active PFC with fast output dynamics. 

The conventional PFC circuits interface well with the utility supply in terms of high 
PF (near unity) and reduced harmonics. However, they cannot be used to feed loads that 
require fast output dynamics. Several alternative PFC schemes to overcome this problem, 
such as cascaded, S^PFC, P^PFC, OSPFC, ISPFC, and FPPFC have been reported. In all 
these the primary aim of the researchers has been to achieve one or more of advantages 
such as (1) improved performance, (2) higher efficiency and (3) cost, size and weight 
benefits. 

The work reported in active-passive PFC schemes is not extensive. Schemes have been 
reported in which an active PFC path is paralleled with a passive rectifier to realise several 
advantages such as increased efficiency, reduced EMI, and increased switching frequency. 

References 

[1] J S Subjak, J S McQuilkin 1990 Harmonics - Causes, effects, measurements, and analysis: 
An update. IEEE Trans. Ind. Appl. IA-26: 1034-1042 

[2] R P Stratford 1980 Rectifier harmonics in power systems. IEEE Trans. Ind. Appl. Vol. 
IA-16: 271-276 

[3] W Shepherd, P Zand 1979 Energy flow and power factor in nonsinusoidal circuits 
(London: Cambridge University Press) 

[4] N Mohan, T M Undeland, W P Robbins 1994 Power electronics: Converters, applica¬ 
tions, and design (New York; John Wiley & Sons) 

[5] Solid state. IEEE Spectrum, January 1994, p. 50 

[6] International Electrotechnical Commission Sub-Committee Ilk 1992 Disturbances in sup¬ 
ply systems caused by household appliances and similar electrical equipment. Draft Revi¬ 
sion of lEC Publication 555.2 

[7] T S Key, Jih-Sheng Lai 1993 Comparison of standards and power supply design options 
for limiting harmonic distortion in power systems. IEEE Trans. Ind. Appl. 29: 688-695 

[8] J Klein, M K Nalbant 1990 Power factor correction - Incentives, standards and techniques. 
Power Conversion Intell. Motion (June): 26-30 

[9] D Maliniak 1992 DC supply keeps line power factor near unity. Electron. Design (May 
28): 33-34 

rim !/■ TAr*,i 100*1 Arr»i»^onriao ViormrMtir'o frr\m ial/ar*trrMiir» <:»nntnmf^nf 


776 


Ramesh Oruganti and Ramesh Srinivasan 


[12] SB Dewan 1981 Optimum input and output filters for single phase rectifier power supply. 
IEEE Trans. Ind. Appl. IA-17; 282-288 

[13] Vorpe’rian, R Ridley 1990 A simple scheme for unity power-factor rectification for high 
frequency AC buses. IEEE Trans. Power Electron. 5: pp 77-87 

[14] P Jain 1994 A unity power factor resonant AC/DC converter for high frequency space 
power distribution system. IEEE Power Electron. Spec. Conf. Rec. 1: 

[15] A R Prasad, P D Ziogas, S Manias 1990 A novel passive waveshaping method for single 
phase diode rectifiers. IEEE Trans. Ind. Electron. IE-37; 521-530 

[16] K Yamashita 1994 Harmonics fighters pursue choke-coil, one converter power supplies. 
Nikkei Electron. Asia (August): 44-47 

[17] Jih-Sheng Lai, D Hurst, T Key 1991 Switch mode power supply power factor improvement 
viaharmonic elimination methods./£££A/7p/. Power Electron. Conf., 7?^c.,pp 415-422 

[18] A R Prasad, P D Ziogas, S Manias 1988 A comparative evaluation of SMR converters with 
and without active input waveshaping. IEEE Trans. Ind. Electron. IE-35(3); (August) 
461-468 

[19] R Redl, L Balogh 1995 Power-factor correction in bridge and voltage-doubler rectifier 
circuits with inductors and capacitors. IEEE Appl. Power Electron. Conf, Rec., pp 
466-472 

[20] N Mohan, T M Undeland, R J Ferraro 1984 Sinusoidal line current rectification with a 
100 kHz B-SIT step-up converter. IEEE Power Electron. Spec. Conf, Rec.: 92-98 

[21] C P Henze, N Mohan 1986 A digitally controlled ac-dc power conditioner that draws 
sinusoidal input current. IEEE Power Electron. Spec. Conf, Rec. : 531-540 

[22] M F Schlecht, B A Miwa 1987 Active power factor correction for switching power supplies. 
IEEE Trans. Power Electron. PE-2: 273-281 

[23] M F Schlecht 1986 Harmonic-free utility/DC power conditioning interfaces. IEEE Trans. 
Power Electron. PE-1: 231-239 

[24] R Omganti, Y T Cham 1993 Power factor correction techniques in off-line switching power 
supplies. J. Inst. Eng. Singapore 33: 27-35 

[25] L H Dixon 1990 High power factor switching preregulator design optimization. Unitrode 
Power Supply Design Seminar Manual, SEM 700, pp 13-1 to 13-12 

[26] J LoCascio, M Nalabant 1990 Active power factor correction using a flyback topology. 
Power Conversion Intell. Motion, Conf, Rec. 

[27] B Andreycak, C H Yearn, J A O’Connor 1991 UC3852 controlled on-time zero current 
switched power factor correction preregulator. Design Guide (Preliminary), Application 
note U-132, Unitrode Power Supply Design Seminar Manual, SEM 800 

[28] P N Enjeti, R Martinez 1993 A high performance single phase AC to DC rectifier with 
input power factor conection./£££ App/. Power Electron. Conf, Rec.,p^ 190-195 

[29] E X Yang, Y M Jiang, G C Hua, F C Lee 1993 Isolated Boost Circuit for Power Factor 
Correction. IEEE Appl. Power Electron. Conf, Rec., pp 196-203 

[30] M S Dawande, G K Dubey 1995 Bang bang current control with predecided switching 
frequency for switch mode rectifiers. IEEE Power Electron. Drives Syst. Conf, Rec., 
pp 538-542 

[31] MS Dawande, G K Dubey 1993 Programmable input power factor correction method for 
switch mode rectifiers. IEEE Appl. Power Electron. Conf, Rec. 

[32] C Zhou, R B Ridley, F C Lee 1990 Design and analysis of hysteretic boost power factor 
correction circuit. IEEE Power Electron. Spec. Conf, Rec. 

[33] C Zhou, M M Jovanovic’ 1992 Design trade-offs in continuous current-mode controlled 
boost power factor- correction circuits. Proceedings of the High Freq. Power Conversion 



Single phase power factor correction — A review 


111 


[34] J J Spangler, A K Behera 1993 A comparison between hysteretic and fixed frequency boost 
converters used for power factor correction. IEEE Appl. Power Electron. Conf., Rec., 

pp 281-286 

[35] W Tang, F C Lee, R B Ridley, I Cohen 1992 Charge control: Modelling, analysis and 
design. IEEE Power Electron. Spec. Conf, Rec. 

[36] W Tang, Y M Jiang, G C Hua, F C Lee, I Cohen 1993 Power factor correction with 
flyback converter employing charge control. IEEE Appl Power Electron. Conf, Rec., 
pp 293-298 

[37] H Endo, T Yamashita, T Sugiura 1992 A high power factor buck converter. IEEE Power 
Electron. Spec. Conf, Rec. 

[38] Y M Jiang, F C Lee 1994 A new control scheme for buck + boost power factor correction 
circuit. Volume V of the Virginia Power Electronic Center publication series 

[39] M C Ghanem, K Al-Haddad, G Roy 1993 A new single-phase buck-boost converter with 
unity power factor. IEEE Ind. Appl Soc. Annu. Meeting, Rec., pp 785-792 

[40] M C Ghanem, K Al-Haddad, G Roy 1996 A new control strategy to achieve sinusoidal line 
current in a cascade buck-boost converter. IEEE Trans. Ind. Electron. 43: 441-449 

[41] D Maksimovic, R Erickson 1995 Universal-input, high-power-factor, boost doubler recti¬ 
fiers. IEEE Appl Power Electron. Conf, Rec., pp 459-465 

[42] A H Mitwalli, S B Leeb, G C Verghese, V J Thottuvelil 1996 An adaptive digital controller 
for a unity power factor converter. IEEE Trans. Power Electron. 11(2): 374-382 

[43] J C Salomon 1993 Techniques for minimising the input current distortion of cuixent- 
controlled single-phase boost rectifiers. IEEE Trans. Power Electron. 8: 509-520 

[44] J C Salomon 1993 Circuit topologies for single-phase voltage-doubler boost rectifiers. 
IEEE Trans. Power Electron. 8: 521-529 

[45] J T Boys, A W Green 1989 Current-forced single phase reversible rectifier. Inst. Elec. 
Eng. Proc. B136: 205-211 

[46] R Srinivasan, R Oruganti 1997 Analysis and design of single phase power factor correction 
with half bridge boost topology. IEEE Appl. Power Electron. Conf, Rec., 1: 489-499 
(Also sent to IEEE Trans. Power Electron.) 

[47] Bakari M M Mwinyiwiwa, P M Birks, Boon-Teck Ooi 1992 Delta-modulated buck-type 
PWM converter. IEEE Trans. Ind. Appl 28: 552-557 

[48] Boon-Teck Ooi, Bakari M M Mwinyiwiwa, X Wang, G Joos 1991 Operating limits of the 
current-regulated delta-modulated current-source PWM rectifier. IEEE Trans. Ind. Appl. 

38:268-274 

[49] S Funabiki, N Toita, A Mechi 1991 A single-phase PWM AC to DC converter with a step 
up/down voltage and sinusoidal source current. IEEE Ind. Appl. Soc., Annu. Mtg., Rec., 
pp 1017-1022 

[50] R Oruganti, M Palaniapan 1996 Inductor voltage controlled variable power factor buck 
type ac-dc converter. IEEE Power Electron. Spec. Conf, Rec., pp 230-237 

[51 ] R Srinivasan, M Palaniapan, R Oruganti 1997 A single phase two-switch buck type AC-DC 
converter topology with inductor voltage control. IEEE Power Electron. Spec. Conf., 
Rec. pp 556-563 (Also sent to IEEE Trans. Power Electron.) 

[52] M J Kocher, R L Steigerwald 1983 An AC-to-DC converter with high quality input wave¬ 
forms. IEEE Trans. Ind. Appl IA-19: 586-599 

[53] L Balogh, R Redl 1993 Power-factor correction with interleaved boost converters in continuous- 
inductor-current mode. IEEE Appl Power Electron. Conf, Rec., pp 168—174 

[54] P C Todd 1993-94 UC3854 controlled power factor coixection circuit design. Unitrode - 
Product Applications Handbook, Lexington, MA 

1551 Mirrn 1 .inpnr Data Bonk 1993 


[56] Vnitrode Applications Handbook 1987-88 Lexington, MA 

[57] L Dixon 1991 Average current mode control of switching power supplies. Unitrode Power 
Supply Design Seminar Manual. SEM 800 

[58] H Matsuo, N Aoike, F Kurokawa, A Hisako 1994 A new combined voltage-resonant inverter 
with high power factor and low distortion factor. IEEE Power Electron. Spec. Conf., Rec. 

1; 

[59] A Ferrari de Souza, I Barbi 1994 A new ZVS-PWM unity power factor rectifier with 
reduced conduction losses. IEEE Power Electron. Spec. Conf., Rec. 1: 

[60] Gyu-Ha Choe, Min-Ho Park 1988 A new injection method for ac harmonic elimination by 
active power filter. IEEE Trans. Ind. Electron. 35: 141-147 

[61] J Nastran, R Cajhen, M Seliger, P Jereb 1994 Active power filter for non-linear ac loads. 
IEEE Trans. Power Electron. 9; 92-96 

[62] H Akagi 1994 Trends in active power line conditioners. IEEE Trans. Power Electron. 9: 
263-268 

[63] H-L Jou, J-C Wu, H-Y Chu 1994 New single-phase active power filter. lEE Proc. Electron. 
Power Appl. 141: 129-134 

[64] D A Torrey, Adel MAM, Al-Zamel 1995 Single-phase active power filters for multiple 
non-linear loads. IEEE Trans. Power Electron. 10; 263-272 

[65] C A Canesin, I Barbi 1996 Analysis and design of constant-frequency peak-current- 
controlled high- power-factor boost rectifier with slope compensation. IEEE Appl. Power 
Electron. Conf, Rec., pp 807-813 

[66] J Lazar, S Cuk 1996 Feedback loop analysis for ac/dc rectifiers operating in discontinuous 
conduction mode. IEEE Appl. Power Electron. Conf, Rec., pp 797-806 

[67] R B Ridley 1994 Average small-signal analysis of the boost power factor correction circuit. 
Volume V of the Virginia Power Electron. Center Publication Series 

[68] F A Huliehel, F C Lee, B H Cho 1992 Small-signal modelling of the single -phase boost 
high power factor converter with constant frequency control. IEEE Power Electron. Spec. 
Conf, Rec., pp 475-482 

[69] P D Ziogas 1981 The delta modulation technique in static PWM converters. IEEE Trans. 
Ind. Appl. IA-17: 199-203 

[70] M S Elmore, W A Peterson, S D Sherwood 1991 A power factor enhancement circuit. 
IEEE Appl. Power Electron. Conf, pp 407-414 

[71] R Oruganti, C Y Thean 1994 A novel PFC scheme for AC to DC converter with reduced 
losses. IEEE Ind. Electron. Conf, Rec. 

[72] M B Harris, A W Kelly, J P Rhode, M E Baran 1994 Instmmentation for measurement of 
line impedance. IEEE Appl. Power Electron. Conf, Rec., pp 887-893 

[73] Valtkovic’, D Borojevic’, F C Lee 1996 Input filter design for power factor correction 
ckcmis. IEEE Trans. Power Electron. 11; 199-205 

[74] F S Dos Reis, J Sebastia’n, J Uceda 1994 Determination of EMI emission in power factor 
preregulators by design. IEEE Power Electron. Spec. Conf, Rec. 2; 

[75] M O Eissa, S B Leeb, G C Verghese, A M Stankovic 1996 Fast controller for a unity- 
power-factor PWM rectifier. IEEE Trans. Power Electron. 11:1-6 

[76] G Spiazzi, P Mattavelli, LRossetto 1995 Powerfactor preregulators with improved dynamic 
performance. IEEE Power Electron. Spec. Conf, Rec., pp 150-156 

[77] Y Jiang 1994 Development of advanced power factor correction techniques. Ph D Disser¬ 
tation, Virginia Polytechnic Institute and State University, Blacksburg, VA 

[78] DM Divan, G Venkataramanan, C Chen 1992 A unity power factor forward converter. 
IEEE Ind. Appl. Soc., Anna. Mtg., Rec., pp 666-672 


[79] I Takahashi, R Y Igarashi 1991 A switching power supply of 99% power factor by the 
dither rectifier. IEEE Int. Telecommun. Energy. Conf., Rec., pp 714-719 

[80] M H Kheraluwala, A R Schnore, R L Steigerwald 1992 Performance characterisation of a 
high power factor power supply with a single power stage. IEEE Ind. Appl. Soc., Annu. 
Mtg, Rec., pp 659-665 

[81] I Takahashi, R Y Igarashi 1992 High power factor switching regulator with no rush current. 
IEEE Ind. Appl. Soc., Annu. Mtg., Rec., pp 673-680 

[82] R Redl, L Balogh, N O Sokal 1994 A new family of single-stage isolated power-factor 
correctors with fast regulation of the output voltage. IEEE Power Electron. Spec. Conf., 
Rec. 2: 1137-1144 

[83] M Madigan, R Erickson, E Ismail 1992 Integrated high quality rectifier-regulators. IEEE 
Power Electron. Spec. Conf, Rec.,^p 1043-1051 

[84] R Redl, L Balogh 1992 RMS, DC, peak, and harmonic currents in high-frequency power- 
factor correctors with capacitive energy storage. IEEE catalog No. 0-7803-0485-3/92, pp 
533-540 


[85] R Redl, L Balogh 1995 Design considerations for single-stage isolated power-factor- 
corrected power supplies with fast regulation of the output voltage. IEEE Appl. Power 
Electron. Conf, Rec., pp 454-458 

[86] M J Willers, M G Egan, J M D Murphy, S Daly 1994 A BIFRED converter with a wide 
load range. IEEE Ind. Electron. Conf, Rec., pp 226-231 

[87] M M Javanovic, D M C Tsang, F C Lee 1994 Reduction of voltage stress in integrated high- 
quality rectifier-regulators by variable-frequency control. IEEE Appl. Power Electron. 
Conf, Rec., pp 569-575 

[88] Y S Lee, K W Siu 1996 Single-switch fast response switching regulators with unity power 
factor. IEEE Appl. Power Electron. Conf, Rec., pp 791-796 

[89] M Brkovic, S Cuk 1995 A novel single-stage AC-DC full-bridge converter with magnetic 
amplifiers for input current shaping. IEEE Power Electron. Spec. Conf, Rec., pp 990- 


995 

[90] Y Jiang, F C Lee, G Hua, W Tang 1993 A novel single phase power factor correction 
scheme. IEEE Appl. Power Electron. Conf, Rec., pp 287-292 

[91] Y Jiang, F C Lee 1994 Single-stage single-phase parallel power factor correction scheme. 
IEEE Power Electron. Spec. Conf, Rec. 2: 

[92] J P Gegner, C-Y Hung, C Q Lee 1994 High power factor ac-to-dc converter using a reactive 
shunt regulator. IEEE Power Electron. Spec. Conf, Rec. 1. 349-355 

[93] R Srinivasan, R Oruganti 1996 A single phase parallel power processing scheme with power 
factor control. Int. J. Electron. 80:291-306 (Also in IEEE Power Electron. Drive Syst. 
Conf, Rec. T. 1995) 

[94] R Srinivasan, R Oruganti 1997 A single phase parallel power processing scheme wi* 
input-shunt power factor correction stage. IEEE Power Electron. Drive Syst. Conf, 


[95] W-I Tsai, Y-Y Sun, W-S Shieh 1991 Modelling and control of single phase switching mode 
rectifiers with near-optimum dynamic regulation. IEEE Ind. Electron. Conf, Rec., pp 

[96] B R Needham, P H Eckerling, K Siri 1994 Simulation of large distnbuted DC power 
system using averaged modelling techniques and the SABER simulator. IEEE Appl. Power 
Electron. Conf, Rec., pp 801-807 

[97] D Kimhi, S Ben-Yaakov 1991 SPICE model for cu^nt mode PWM converters operating 
under continuous inductor current conditions. IEEE Trans. Power Electron. 6. 281-286 




780 


Ramesh Oruganti and Ramesh Srinivasan 


[98] K Mahabir, G Verghese, J Thottuvelil, A Heyman 1990 Linear averaged and sampled data j 
models for large signal control of high power factor AC-DC converters. IEEE Power 
Electron. Spec. Conf., Rec., pp 372-381 

[99] S R Sanders, J M Noworolski, X Z Liu, G C Verghese 1991 Generalised averaging method 
for power conversion circuits. IEEE Trans. Power Electron. 6: 251-259 

[100] J Thottuvelil, D Chin, G C Verghese 1991 Hierarchical approaches to modelling high- 
power-factor AC-DC converters. IEEE Trans. Power Electron. 6: 179-187 

[101] P T BCrein, J Bentsman, R M Bass, B L Lesieutre 1990 On the use of averaging for the 
analysis of power electronic systems. IEEE Trans. Power Electron. 5: 182-190 

[102] L Dixon 1989 High power factor preregulators for olf-line switching power supplies. Uni- 
trode Power Supply Design Seminar Manual, SEM 600 

[103] R Watson, G C Hua, F C Lee 1994 Characterisation of an active clamp flyback topology for 
dc-dc conversion and power factor correction alications. IEEE Appl. Power Electron. 
Conf., Rec., pp 412^18 

[104] R Watson, F C Lee, G C Hua 1994 Utilisation of an active-clamp circuit to achieve soft 
switching in flyback converters. IEEE Power Electron. Spec. Conf. Rec. 

[105] G C Hua, C S Leu, Y M Jiang, F C Lee 1992 Novel zero-voltage-transition PWM converters. 
IEEE Power Electron. Spec. Conf, Rec., pp 55-61 

[106] R Srinivasan 1998 Single phase power factor correction - investigation and develop¬ 
ment of advanced techniques. Ph D thesis. National University of Singapore, Singapore 



Flexible AC transmission systems: A status review 


K R PADIYAR and A M KULKARNI 

Department of Electrical Engineering, Indian Institute of Science, 

Bangalore, India 560012 
e-mail; krpyar@ee.iisc.emet.in 

Abstract. With the availability of high power semiconductor switches with 
turn-off capability, voltage source converter based controllers for power trans¬ 
mission system applications have become a reality. Prototypes of some second 
generation Flexible AC Transmission System (FACTS) controllers like TCSC 
and STATCON have been installed. This paper presents a review of the progress 
in FACTS. A generalized description of FACTS controllers is also presented. 

Keywords. Flexible AC transmission systems; static condenser; unified power 
flow controller; thyristor controlled series compensation. 


1. Introduction 

Expansion in power transmission networks has taken place not only due to the increase 
in generation and loads but also due to the extensive interconnection among different 
power utilities. The major factor for system interconnection is the economy possible in the 
reduced generation reserves for achieving the same level of reliability of supply. 

Except for the limited number of HVDC links in a system, the vast majority of trans¬ 
mission lines are AC. The power flows in AC lines are uncontrolled and are determined 
by Kirchhoff’s laws. This is in contrast to HVDC links where the power flow has to be 
regulated by converter controls. The lack of control in AC networks can be considered an 
advantage from the point of view of avoiding additional equipment. However AC lines 
have the following disadvantages. 

(1) Power flow in parallel paths is determined according to their reactance. As a first 
approximation, the power flow in AC networks can be compared to the power flow 
in DC resistive networks where the resistance is analogous to the reactance. The 
operation of KVL (Kirchhoff’s voltage law) implies that the network is often not 
optimally utilized. 

(2) Power flow in AC lines (except short lines of lengths below 150 km) is limited by 
stability considerations. This implies that the lines may operate normally only at power 
levels much below their thermal limits. 
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(3) The operation of KVL and lack of control in AC lines implies that the normal power 
flow in a line is kept much below the peak value which itself is limited by stability (as 
mentioned earlier). This margin (or reserve) is required to maintain system security 
under contingency conditions. 

(4) The AC transmission network requires dynamic reactive power control to maintain 
satisfactory voltage profile under varying load conditions and transient disturbances. 

(5) AC lines, while providing synchronizing torque for oscillating generator rotors may 
contribute negative damping torque which results in undamped power oscillations, 
(particularly with fast acting static exciters and high gain automatic voltage regulators) - 

(6) The increase in load levels is accompanied by higher reactive power consumption in 
the line reactances. In case of mismatch in the reactive power balance in the system, 
this can result in voltage instability and collapse. 

The reactive power compensation of AC lines using fixed series or shunt capacitors 
can solve some of the problems associated with AC networks. However the slow nature 
of control using mechanical switches (circuit breakers) and limits on the frequency of 
switching imply that faster dynamic controls are required to overcome the problems of AC 
transmission networks. Recent developments involving deregulation and restructuring of 
the power industry are aimed at isolating the supply of electrical energy (a product) from 
the seiwice, involving transmission from generating stations to load centres. This approach 
is feasible only if the operation of AC transmission lines is made flexible by introducing 
fast-acting high-power solid state controllers using thyristor or GTO valves (switches). 
The advent of high voltage and high power thyristor valves and digital controllers in 
HVDC transmission has demonstrated the viability of deploying such controllers for power 
transmission. Thyristor controllers were also utilized in the late seventies to control current 
in reactors and switch capacitors and this led to the development of Static Var Compensators 
(SVC). 

Flexible AC Transmission System (FACTS) is a concept proposed by Hingorani 
(1988, 1991, 1993) that involves the application of high power electronic controllers 
in AC transmission networks which enable fast and reliable control of power flows and 
voltages. FACTS do not indicate a particular controller but a host of controllers which 
the system planner can choose, based on cost benefit analysis. The objectives are as 
below. 

(1) Regulation of power flows in prescribed transmission routes. 

(2) Secure loading of fines nearer their thermal limits. 

(3) Prevention of cascading outages by contributing to emergency control. 

(4) Damping of oscillations which can threaten security or limit the usable line capacity. 

Table 1 shows that the controllers can be broadly classified into two classes: 

(a) shunt-connected controllers providing voltage control; 

(b) series-connected controllers providing power flow control. 
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FACTS controllers. 



Type 

Main function 

Controller used 

Comments 

: var compensator) 

Shunt 

Voltage control 

Thyristor 

Variable impedance 
device 

ristor controlled 
)ensation) 

Series 

Power flow 
control 

Thyristor 

Variable impedance 
device 

yristor controlled 

Series 

Power flow 

Thyristor 

Phase control using 

5 regulator) 

and shunt 

control 


series (quadrature) 
voltage injection 

(static condenser) 

Shunt 

Voltage 

control 

GTO* 

Variable voltage 
source 

c synchronous 
Densator) 

Series 

Power flow 
control 

GTO* 

source 

Variable voltage 

Red power flow 

Shunt 
and series 

Voltage and 
power flow 
control 

GTO* 

Variable voltage 
source 


tead of the GTO, other power semiconductor devices with turn-off capability such as IGBT or 
S controlled thyristors) can also be used. 


^ provides power flow control mainly by injecting a quadrature voltage in se- 
complex power generated by the series voltage source is supplied by the shunt- 
i transformer. The simplified expression for power flow in a lossless transmission 
'en by 

P — sin(3s/; + 0) 

~ X ’ ^ 

and Vr are sending and receiving end bus voltages, X is the series reactance of 
\SR is the difference in the bus angles, 0 is the phase angle shift introduced by a 
jle regulator (phase shifting transformer). 

•vious from (1) that the control of voltage, series reactance and phase angle (0) 

^ power flow. While the control over the first two variables can be used to increase 
r limit, the control over 0 can be used to regulate power flow in loops, 
tor controllers have also been employed or proposed for the following. 

/nchronous Resonance (NGH) damping (Hingorani 1981). 

istor Controlled Dynamic Brake (Rao & Nagsarkar 1984; Raschio et al 1995). 

Current Limiting (Salama et al 1993; Sugimoto et al 1996). 

voltage Protection (Sarkozi etal 1994). 

> controllers have also been proposed in distribution systems for control of power 
.^arsen et al 1992; Hingorani 1995; Akagi 1996; Sabin & Sundaram 1996). The 
s are to limit voltage fluctuations and reduce the impact of momentary interrup- 
would affect sensitive loads. In addition, distribution type FACTS devices can be 
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In this paper, the state of the art in the development of FACTS is presented. The attention 
is focussed on the new FACTS controllers based on voltage source converters (STATCON 
and UPFC). 


2. Description of FACTS controllers 

In this section we briefly describe the basic principle of operation of some of the FACTS 
controllers. 


2.1. Static Var compensator 

SVC is an important FACTS device already widely in operation. Ratings range from 60 
MVAR to 600MVAR. SVC can be considered as a “first generation” FACTS controller 
and uses thyristor controllers. It is a shunt reactive compensation controller (Gyugyi 1979, 
1988; Miller 1982) consisting of a combination of fixed capacitor or thyristor-switched 
capacitor in conjunction with thyristor-controlled reactor (FC-TCR or TSC-TCR) (fig¬ 
ure 1). The SVC is a variable susceptance controller; the effective susceptance is varied 
by changing the conduction time of the thyristors of the TCR and/or switching in/out 
the shunt capacitor using a TSC. Early applications of the SVC were for load compen¬ 
sation of fast-changing loads like arc furnaces and steel mills for dynamic power factor 
improvement and load balancing in the three phases. Transmission line compensators are 
used not only for reactive power compensation (maintaining voltages at key points of the 
network within limits) but also for improving system stability. The control strategy usually 
employed is to use the SVC as a voltage regulator. In addition supplementary controls are 
used for damping power oscillations. The steady state control characteristics of a SVC is 
shown in figure 2. In the controllable range, a small slope is given to the characteristic to 
prevent frequent hitting of limits and also to facilitate parallel operation. 


SVC BUS 



Figure 1. TCR-TSC type SVC. 
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2.2 Thyristor-controlled series compensator 

The TCSC is a second generation FACTS controller which controls the effective line 
reactance by connecting a variable reactance in series with the line. The variable reactance 
is obtained using a FC-TCR combination with mechanically switched capacitor sections 
in series. Figure 3 shows a TCSC with two modules connected in series. Use of series 
compensation is usually a preferable alternative for increasing power flow capability of 
lines as compared to shunt compensators as the ratings required for series compensators 
are significantly smaller. Use of TCSC can reduce the adverse torsional interactions with 
generator-turbine shafts (“Subsynchronous Resonance”) which is a major concern in the 
application of series capacitors. 

The first demonstration project of TCSC was commissioned in 1991 at a 345 kV Kanawha 
River Substation in West Virginia, USA under American Electric Power Company. This 
was a test installation of thyristor switches in one phase for rapid switching of series 
capacitor segments and was supplied by Asea Brown Boveri, Sweden. 

In October 1992, the first three-phase TCSC was installed at 230 kV Kayenta Substation 
in Arizona under the Western Area Power Administration (WAPA) (Christl 1991). Here 
a 15^^ capacitor bank is connected in parallel with a TCR and permits smooth and rapid 
control of (capacitive) reactance between 15 and 60 ^2 through phase control of TCR (a 
varying between 145 to 180). 


yww 


BREAKER 
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Vs 


Vr 


rvT\r\ 

Exciting Transformer 


Avrv^ 

Boosting Transformer 


Converter 



Figure 4. Static phase shifter schematic. 


A large prototype three-phase TCSC was installed in 1993 at 500 kV Slatt Substation in 
Oregon under the Bonneville Power Administration (BPA) (Urbanek et al 1993; Piwko et 
al 1994). The project was sponsored by the Electric Power Research Institute (EPRI) and 
the equipment was developed by General Electric in U.S.A. Here, six modules of TCSC 
are connected in series and controlled to provide a variation in impedance from -f 1.4^2 to 
-16^2. 


2.3 Thyrisior-controlled phase angle regulator 

This is also known as Static Phase Shifter (SPS) and phase shift with respect to the bus 
voltage is achieved by adding or subtracting a variable voltage component in quadrature 
with the bus voltage. This variable quadrature voltage component in one phase is obtained 
by thyristor switches and by an exciting transformer which is connected between the other 
two phases. The quadrature voltage is injected in series with the transmission line by a 
boosting transformer. The basic arrangement is shown in figure 4. 

Many configurations of the SPS have been proposed (some employing GTOs). Also, 
with some configurations continuously variable phase angle of injected voltage (not nec¬ 
essarily in quadrature) is achievable. For a description of available topologies for SPS and 
a comparison between them see the review by Iravani & Maratukulam (1994). 

2.4 STATCON 

STATCON is a shunt reactive compensation device; however, unlike the SVC it is based on 
voltage source converter (VSC) using GTOs (figure 5). The principle of operation is similar 
to that of a synchronous condenser. The VSC is connected to the system through a small 
reactance which is the leakage reactance of the coupling transformer. The VSC produces 
a set of three phase voltages which are in phase with the corresponding bus voltages. The 
reactive power is varied by varying the magnitude of the converter output voltages. A small 
phase difference exists in the steady state (depending on reactive power output) so that 
real power can be drawn from the lines to compensate for the losses. The current on the 
DC side is mainly a ripple of magnitude much smaller than the AC line currents. As no 
real energy exchange (except to compensate for losses) takes place in steady state, the DC 
voltage can be maintained by a capacitor. 

The STATCON can also exchange active power with the system if it has an energy 
source at its DC terminals. In this situation, the phase of the inverter output voltages with 
respect to corresponding bus voltages is varied to obtain control over active power. 
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Figure 5. Six-pulse STATCON. 


The major advantages of the STATCON over the SVC are (Edwards et al 1988; Larsen 
al 1992); 

) The STATCON can supply required reactive current even at low values of bus volt¬ 
age, whereas the reactive current capability of SVC at its susceptance limit decreases 
linearly with decrease in bus voltage (compare figures 2 and 6). 

) With proper choice of device ratings and thermal design, STATCON can have a short 
time overload capability. This is not possible in an SVC because there is an inherent 
susceptance limit. 

) Significant size reduction can be achieved because of reduced number of passive 
components and their smaller size. 

) STATCON can allow for real power modulation if it has an energy source or large 
energy storage at its DC terminals. 

An early application of a ±20 MVAR STATCON (Sumi et al 1981) used forced com¬ 
utated thyristors. Subsequently there have been GTO based installations which include a 
1 MVAR (Edwards et al 1988) and a larger ±80 MVAR (Mori et al 1993) STATCON. A 
'ge ±100 MVAR prototype installation has been reported recently (Schauder et al 1995). 
)me of the salient features of this STATCON are as follows. 

) 48-pulse inverter (8 inverters connected in series on the AC side with appropriate phase 
shifts and in parallel on the DC side). 

) DC capacitor voltage; 6.6 kV. 



Irtat 


Figure 6. Control characteristics of 
STATCON. 
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(3) Inverter valves employ 5 GTO modules in series. 

(4) GTO rating: 4500 V, 4000 A (peak turn-off)- 

A bidirectional 18-pulse voltage source converter utilizing GTOs and rated at lOMVA 
was reported (Walker 1990). The converter is rated to operate in all four quadrants and 
connects a storage battery to a utility grid. It can be used for static Var control, as a power 
system stabilizer or as a real power peaking station. 

In addition to multipulse topology, multilevel converter topology has also been proposed 
for the STATCON (Menzies & Zhuang 1995; Ekanayaki & Jenkins 1996). However the 
optimum circuit configuration is difficult to choose because relative significance of design 
parameters will vary from application to application. 

2.5 Unified power flow controller (UPFC) 

The UPFC (Gyugyi 1992,1994,1995) is the most versatile of FACTS controllers. The main 
function of the UPFC is to control real and reactive power flow through the line by injection 
of a (variable) voltage in series with the transmission line. A schematic of the UPFC is 
shown in figure 7. The UPFC consists of 2 GTO-based inverter branches. The series branch 
consists of a voltage source inverter which injects a voltage in series through a transformer. 
Since the series branch of the UPFC can inject a voltage with variable magnitude and 
phase angle it can exchange real power with the transmission line. However the UPFC as 
a whole cannot supply or absorb real power in steady state (except for the power drawn to 
compensate for the losses) unless it has a power source at its DC terminals. Thus the shunt 
branch is required to compensate (from the system) for any real power drawn/supplied by 
the series branch and the losses. In addition the shunt branch can exchange reactive power 
independently with the system. Thus the UPFC offers three controllable parameters. If 
only the series branch of the UPFC is used, there is only one controllable parameter which 
is the magnitude of the injected voltage (the phase of the injected voltage is constrained 
to be in quadrature with the line current so that the device cannot supply or absorb real 
power). This device is called the Static Synchronous Series Compensator (SSSC). 

The world’s first demonstration of a UPFC is under installation by American Electric 
Power Co., at the utility’s Inez Station in eastern Kentucky, USA. It is to be installed in 
two phases; the ±160 MVAr shunt inverter has been installed first for voltage support in 
1997. Subsequently, the series inverter of the same rating will be installed. The converter 
will use GTOs and a three-level topology. 


PORT 1 PORT 2 

Transmission line 



Figure 7. UPFC. 
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Figure 8. Circuit representation. 


2,6 Generalized description 

Based on the treatment given by Gyugyi (1992), the influence of a FACTS controller in a 
transmission line can be viewed as injection of a shunt current source and a series voltage 
source (see figure 8). This is the most general description of a FACTS controller, such as 
a UPFC. 

Ignoring active power losses in the controller, the following constraint equation 
applies 


Re[Vi*] = Re[el% (2) 

It is assumed that both e and i are sinusoidal and can be expressed as phasors in steady state. 
Equation (2) shows that there are three independant variables (for example, magnitude and 
phase angle of e and the value of reactive component of current in i ) that can be manipulated 
to control power flows (active and reactive) in a line. 

It is to be noted that the general case, a FACTS controller may contain an energy source 
and thus even the constraint equation (2) does not apply. However this is unlikely except 
in the case of STATCON which may use a battery energy storage system for providing 
electrical energy during interruptions caused by system faults. 

Except for UPFC, most of the FACTS controllers use only single control variable as 
listed in table 2. 


Table 2. Mathematical description. 


Controller 

Constraint equations 

Control variable 

SVC 

e — 0,i — jBsvc^ 

Bsvc 

TCSC 

i =0,e = jXrcscl 

Xtcsc 

TCPAR 

Vi = el 


STATCON 

V=Ve 

e = 0, Re[Vi*] = 0 

i (reactive current) 

without energy source 
STATCON 

e = 0 

ia (active current) 
i 





3. Control of STATCON and UPFC 


3.1 STATCON control 

The primary controller of the STATCON is the reactive current (component of current in 
quadrature with the bus voltage) controller. A closed-loop control of reactive current is 
necessary because the STATCON current is dependent not only on the firing angle of the 
GTOs but also on the parameters of the rest of the system as well. 

There are two controller structures which have been proposed for reactive current control 
of STATCON (Schrouder & Mehta 1993). 

(1) Type /: The controller uses both magnitude and phase angle control of the voltage 
controllers to control the real and reactive current (figure 9). The real current is used to 
maintain capacitor at a constant voltage. The reactive current can be used as an inner 
loop of a bus voltage regulator. The real current is used to maintain capacitor voltage 
constant. 

(2) Type It Reactive current control can also be achieved by phase-angle control alone. 
The capacitor voltage is not controlled but depends upon the phase difference between 
the converter output voltages and the bus voltage (the phase difference is usually very 
small). Thus the converter voltage magnitude (which is dependent on the capacitor 
magnitude alone in this case) varies with phase angle. As a result, control over reactive 
current is possible by phase control. Control of bus voltage is achieved as in type I 
type controller (figure 10). 

PI control of STATCON (type II) poses problems due to nonlinearity in the state equa¬ 
tions. Nonlinear state feedback control is used to overcome this problem by Schrauder & 
Mehta (1993). Fuzzy logic control (which can be thought of as nonlinear heuristic control) 
has also been used in (Padiyar & KuLkami 1997) and shows superior performance under 
varying system conditions. Control of STATCON for reducing overcurrents during unbal¬ 
anced faults using PWM has been discussed by Jiang & Ekstrom (1995). In the design 
of control systems, the possibility of adverse interactions with the network and/or other 
FACTS controllers in the vicinity has to be kept in mind (Clark et al 1995; Woodford 
1996). The problem of network-controller interactions has been reported in the voltage 
control of STATCON (Padiyar & Kulkami 1997) which is solved by filtering the voltage 
signal. 


bus 


voltage 
reference 

capacitor ^ 
voltage 
reference 




lb us 
voltage 




capacitor 

voltage 



order 


Figure 9. Type I type controller. 
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Xnverter 
voltage 
angle 
order 

voltage 

Figure 10. Type II type controller. 

2 UPFC control 

tie UPFC allows three controllable parameters, viz., magnitude and phase of series in- 
cted voltage, and shunt reactive current. A control strategy for UPFC has been proposed 
’adiyar & Kulkami 1996) in which active power flow through the line is controlled, while 
gulating magnitudes of the voltages at its two ports. 

The controller for this purpose uses only local measurements. The series voltage of the 
PFC is split into two parts, one in phase with the line current (“real voltage”) and the 
her in quadrature (“reactive voltage”). Series reactive voltage is used to control active 
iwer flow in the line, while real voltage and shunt reactive current injection are used 
I regulate voltage at the two ports. A block diagram representation of the series voltage 
)ntroller is shown in figure 11. The power controller sets the reactive voltage reference, 
n auxiliary current angle stabilizing signal is used to damp a critical network mode 
isociated with series reactance of line (steady state contribution of this auxiliary signal is 
jro because of the washout block [^Ty;/(1 + ^Tio)]). A damping controller for the slow 
ectromechanical modes of the system associated with generator rotor swings modulates 
le power reference, and uses the generator slip signal. A synchronising component {K/s) 
present in the controller in addition to the damping component (D) to reduce angle 
aviations. The real voltage to be injected can be calculated so as to maintain port 2 
Dltage magnitude of the UPFC constant. The voltage and power references can be set 


■bus 


voltage 

reference 


bus 



reactive current 


Ci(.s) 

reference 

reactive 


current 

controller 



voltage 


Figure 11. UPFC series voltage controller structure. 
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according to steady state real and reactive power flow requirements. The shunt current 
controller is similar to the one shown in figure 9. 

4. Stability improvement using FACTS controllers 

The fast response of FACTS controllers can be used effectively to improve dynamic per¬ 
formance of the system. The dynamic problems associated with power systems can be 
characterised as (Padiyar 1996): 

(1) Angle instability: which includes small signal oscillatory instability (“dynamic in¬ 
stability”) and large disturbance instability leading to loss of synchronism amongst 
generators (“transient instability”). These problems are associated with generator rotor 
swings. 

(2) Voltage instability: which implies an uncontrolled collapse in voltage at load buses 
precipitated due to some disturbance and is primarily caused by dynamics of the load. 

(3) Subsynchronous resonance (SSR): This problem is caused by the adverse interaction of 
the electrical system with the generator-turbine mechanical system causing oscillatory 
instability or transient torques. This problem is usually present in radial series com¬ 
pensated lines connected to turbine generators. 

While the application of FACTS controllers for steady state active and reactive power 
flow control is relatively straightforward, their use for stability improvement poses the 
following questions: 

(1) Where should the controllers be located in the system for maximum effectiveness? 

(2) What should be the control laws? 

Usually FACTS controllers are used for steady state functions which is the main control 
objective, but supplementary controls are included for stability improvement. However, 
controllers such as Thyristor Controlled Brake (Rao & Nagsarkar 1984: Raschio et al 1995) 
and NGH-SSR damper (Hedin et al 1981; Hingorani 1981; Hingorani et al 1987; Benko 
et al 1987) are used specifically for transient stability improvement and SSR mitigation 
respectively. It should be kept in mind that the control strategy employed will vary from 
application to application depending on the relative severity of the various dynamic and 
steady state problems. 

For SVCs, while voltage control is the main function, Supplementary Modulation Con¬ 
troller (SMC) can be introduced for damping small signal oscillations. This controller 
modulates either the voltage reference of the voltage controller or the susceptance. A typ¬ 
ical configuration of SMC is shown in figure 12. A washout block is used to drive SMC 
output to zero in steady state. Similar control structures can be used for TCSC. 


p'^lter gain washout phase lead 



Figure 12. Supplementary modulation controller. 
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Methods proposed in literature for deciding location of FACTS controllers and conti*ol 
rategies (choice of control signal {u in the SMC) and the corresponding control law) are 
ased mainly on eigenvalue sensitivity/residue (of transfer function) approaches (Padiyar 
'^al 1986; Martins & Lima 1990; Okamato etal 1995). Computation of several indices has 
3en proposed (Larsen et al 1995) to select the appropriate control signal and control law. 
ontrol strategies have also been proposed based on energy function approach (Gronquist 
^al 1995). 

Study of large disturbance dynamics involves analysis of a large set of nonlinear dif- 
;rential equations. Design of control strategies and deciding on the best location from 
le point of view of transient stability improvement is a difficult problem and has to be 
ilidated using simulation studies. Large disturbance stability improvement with FACTS 
3ntrollers has been shown using simulation studies (see for example Mihaltic et al 1996). 
'iscrete control sti'ategies have been proposed by Kosterev & Kolodziej (1995) and Padiyar 
: Uma Rao (1997). 

The problem of torsional interactions (SSR) has been a major concern, primarily because 
>me FACTS controllers like SVC have the potential to cause adverse interations, even in 
le absence of series compensation (Rustomkolai etal 1990). Therefore thorough studies 
eed to be done in order that FACTS controllers themselves do not aggravate this problem, 
tudies on application of TCSC and field tests indicate that SSR problem can be reduced 
ith it (see Piwko et al 1996). It is possible (FACTS controllers can have time of response 
f the order of 1-3 cycles) that supplementary controls be introduced to damp SSR directly, 
se of auxiliary signals to damp SSR has been proposed for SVCs (see Ramey et a/1981, 
/ascjnczuk 1981, Putman & Ramey 1982, Hammad & El-Sadek 1984, Abi Samra et al 
985, Padiyar & Varma 1990). Further work needs to be carried out with new controllers 
ich as STATCON, UPFC and SSSC. 

. Conclusions 

he application of FACTS controllers throws up new challenges for power engineers, not 
nly in hardware implementation, but also in design of robust control systems, planning 
nd analysis. 

There has been considerable progress in the application of FACTS controllers. Notably, 
rototypes of TCSC and STATCON of sufficiently high ratings are now in operation. Imple- 
lentation of several other controllers such as UPFC are being planned. Field experiences 
dth these installations are eagerly awaited by power engineers. 

The new generation of FACTS controllers are voltage source converter-based and have 
2 veral advantages over thyristor-controlled variable impedance type controllers. However, 
lese controllers require switching devices with turn-off capability. At present, only GTO 
evices are available at the voltage and current ratings required. These devices have certain 
rawbacks and it is anticipated that in future, better and more efficient devices like IGBT 
nd MCT will be available in the required ratings. SpuiTed by demand, the cost of these 
evices is also expected to come down. 

There is every reason to believe that in a decade or so, FACTS controllers will rev- 
lutionize electrical power transmission systems making them more reliable, optimally 
tilized and better controlled. 
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Abstract. This paper attempts to present a comprehensive review of the ad¬ 
vances made in vector control or field orientation as applied to high performance 
ac motor drives. Brief application survey, machine models in d-q representa¬ 
tion, implementation issues with inverters and cycloconverters, parameter ef¬ 
fects etc for both induction and synchronous motor vector control are dealt 
with and sample results from studies on them are presented. The latest advance 
on this control like direct torque control (DTC) has been briefly discussed. A 
substantial updated bibliography, though by no means complete, is included 
for those who are interested in keeping track of the present state-of-the-art and 
working further in this area. 

Keywords. Vector control; field orientation; ac motor drives; high perfor¬ 
mance drives; induction motor; synchronous motor; direct torque control. 


1. Introduction 

Electric drives for motion control must have a fast torque response, four quadrant operation 
capability and controllability of torque and speed over a wide range of operating conditions. 
A separately excited dc motor, earlier used as the primary machine and later with simple 
power electronic controllers and current feedback, provides direct control of the magnitude 
of armature current and, in proportion, the torque, and has been the most popular choice for 
many industrial drives for such requirements in spite of its inherent drawback of the bulky, 
expensive and maintenance-prone commutator. On the other hand, ac motors, specially 
induction motors with their simple, less expensive, and more robust structures are more 
suitable for industrial environments though their control is quite complex. This is due to 
the fact that the rotor current in an induction motor which is responsible for the torque 
production owes its origin to the stator current which also contributes to the air-gap flux 
resulting in a coupling between the torque- and flux-producing mechanisms. In the dc 
machine, the field current in the stationary poles producing the magnetising flux and the 
armature current directly controlling the torque are independently accessible. Moreover, 
for a fully compensated dc motor, the spatial angle between the flux and the armature mmf 
is held at 90° with respect to each other, independent of the load, by the commutator and 


the brushes whereas in an ac motor (both induction and synchronous), the spatial angle 
between the rotating stator and rotor fields varies with the load and gives rise to oscillatory 
dynamic response. Control methods for ac motors that emulate the dc motor control by 
orienting the stator current so as to attain independent and ‘decoupled’ control of flux and 
torque are known as ‘field orientation’ control and require control of both the magnitude 
and phase oiac quantities and thus are referred to as ‘vector control methods’. 

Early conceptual works in vector control were by Blaschke (1972) and Hasse (1969), 
which were translated into practical implementation later by Gabriel et al (1980), Leon- 
hai'd (1985) and many others with the advances in microprocessors and microcomputers 
along with power electronics. Now, it has been established as a powerful technique in the 
field oi ac motor drives and adopted worldwide. An exhaustive list of publications has 
been reported in this topic, which includes an IEEE Tutorial Course (Novotny & Lipo 
1985) and two exclusive books (Vas 1990; Boldea & Nasar 1992). Work has continued un¬ 
abated in this field and several issues like simplification of practical system with advanced 
microprocessors, design of current regulators/flux observers, reliability enhancement, per¬ 
formance improvement, parameter adaptation etc. are still attracting the researchers in this 
field. This paper attempts to make a summary review of the progress in vector control as 
applied to both induction and synchronous motor drives highlighting some typical results 
from the drives developed by the author and his research students at the Indian Institute of 
Technology, Kharagpur. 


2. Vector control of induction motors 

2.1 Brief application survey 

The principle of vector control is used in current regulated PWM inverter (CRPWM), CSI, 
VSI, and cycloconverter-fed induction motor drives. The controlled current operation of 
the motor results in simpler implementation. The CRPWM inverter is common for high 
performance servo drives while CSI and cycloconverters are used for larger drives. High 
frequency PWM transistor inverters (10 kHz), developed around 1979, made it possible to 
use vector controllers in various kinds of industries including pinch roll drives of continuous 
casting plates, machine-tool drives and gear-less servo drives as reported by Kume & 
Iwakane (1987). The control method was applied to a large-scale paper mill (Tanaka et 
al 1983) with induction motors of 300-560 kW rating using CSI. Application of vector 
controlled induction motors for high performance servo drives has been brilliantly surveyed 
by Leonhard (1986). High horsepower vector controlled induction motor servo drive using 
adaptive rotor flux observer has been recently developed with improved steady state and 
dynamic response (Huang etall 994). The recent trend is to eliminate the speed and position 
sensors in high performance vector controlled induction motor drives (Okuyama et al 1990; 
Onishi et al 1994; Tajima et al 1995). Very L’gh power (MW) range cycloconverter-fed 
induction motors, with vector control for steel mill drive are mature drive systems in Japan 
(Sugi et al 1983; Saito et al 1987) and Germany (Timpe 1982; Hasse 1977). Siemens has 
recently announced optimised vector controlled SMOVERT master drives for elevator 
applications (Scheirling & Schonheir 1995) having many important features. 



2.2a Dynamic model: A dynamic model developed either with the concept of space 
phasors (Leonhard 1985; Murphy & Turnbull 1988) or d-q representations (Novotny & 
Lipo 1985; Bose 1986) may be utilised to develop the basic machine equations for imple¬ 
mentation of vector control. We like to use the latter for convenience and familiarity. The 
d-q axes model of an induction motor with reference axis rotating at synchronous speed 
We is 
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The field orientation implies that the stator current components obtained be oriented in 
phase (flux component) and in quadrature (torque component) to the flux vector which can 
be either stator flux {^fg), airgap or mutual or magnetising flux (V^m). or rotor flux (V^^) as 
shown in the equivalent circuit in figure 1 (Sathiakumar et al 1986). The orientation of the 
stator current with respect to the stator, rotor and airgap flux has been examined and the 
relative merits and developments of the schemes have been reported (Bayer & Blaschke 
1977; Sathiakumar et al 1986; Ho & Sen 1988; Erdman & Hoft 1990). It has been shown 
that the rotor flux orientation alone provides natural decoupling, fast torque response and 
all round stability. The stator flux and airgap flux orientation, however, are attractive due 
to ease of flux computation and for the purpose of wide range of field weakening operation 
(Xu & Novotny 1992) but need decoupler network (De Doncker & Novotny 1988). A new 
strategy called the ‘Universal field oriented controller’ has been developed by De Doncker 
& Novotny (1988) which decouples flux and torque in an arbitrary flux reference frame. 

Rewriting the rotor voltage equations in (1) 
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Figure 1. Conventional stator referred induction motor equivalent circuit showing different 
flux vectors. 

For rotor flux orientation control, the rotor flux axes are locked with the synchronously 
rotating reference system such that the rotor flux is entirely in the <i-axis, 


Substituting (7) & (8) in (5) & (6) yields 
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For the range of operation below the base speed, the flux -ij/f = is kept constant, when 

prdr = 0. ( 11 ) 

From (4) & (8), 
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which shows a direct equilibrium relation between the torque component current and 
the rotor current . The torque equation is 
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During flux changes in the transient, pfdr 0 and from (10) 


which shows the desired property of providing a torque proportional to the torque command ^ 
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Combining (14), (4) & (8) to eliminate yields the equation relating and (flux 
command and the flux), 

(K + L',p)rdr^RrLmi^ds^ (15) 

which in the steady state is 

tdr^Lmi^ds- ( 16 ) 

The close parallel to the machine is now clearly visible. With the flux command held 
constant, a change in is followed instantly by corresponding change in While with 
a change in flux command, a transient rotor current is induced which subsequently decays 
with the rotor open circuit time constant Lj./i?' as shown in (15). 


2.2b Steady state model: A convenient steady-state equivalent circuit model of the 
field oriented induction motor as shown in figure 2 can be obtained from the conventional 
equivalent circuit (figure 1) by using a referral ratio a = Lm/L'j. in lieu of the common 
choice of the stator to rotor turns ratio (Novotny & Lipo 1985). With adoption of this 
ratio, the stator current is seen to be subdivided into the orthogonal components (flux 
component) and Ist (torque component), equivalent to z^^ and referred in the dynamic 
model, and the slip relation (9) can be obtained by equating the voltages across the parallel 
branches as 
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Equation (17) expresses the co-ordination between the slip and the current components 
required to attain correct field orientation as relevant for indirect vector control discussed 
later. The torque expression is obtained from the airgap power as 
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which shows the desired torque control via current components Is-^ and Ist- 



Figure 2. Derived steady state equivalent circuit for rotor flux orientation scheme. 
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2.3 Induction motor vector control implementation 

The implementation of vector control requires information regarding the magnitude and 
the position of the flux vector (stator, rotor or mutual, as the case may be) and fast control of 
stator current in both magnitude and phase. Depending upon the method of flux acquisition, 
the vector control can be direct (Blaschke 1972) or indirect (Hasse 1969). The universal 
field oriented controller developed by De Doncker & Novotny (1988) is applicable to both 
these field orientation schemes and the generalised approach by Ogaswara ei < 3 / (1988) to 
indirect control of induction and synchronous motors. 


2.3a Direct field orientation: In the direct method, also known as flux feedback method, 
the airgap flux is directly measured with the help of sensors such as Hall probes, search 
coils or tapped stator windings (Zinger et al 1990) or estimated/observed from machine 
terminal variables such as stator voltage, current and speed (Jansen et al 1994). Since it is 
not possible to directly sense rotor flux, it is synthesised from the directly sensed airgap 
flux using the following equations 
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A variety of flux observers can be employed to estimate and improve the flux response 
with less sensitivity to machine parameters as detailed by Verghese & Sanders (1988) 
and Atkinson et al (1991). A major drawback with the direct orientation schemes is theii 
inherent problem at very low speeds when the machine JR drop dominates and the required 
integration of the signals to measure the airgap flux is difficult. Closed-loop stator flu:7< 
observers based on the motor current, voltage and the measured rotor position have beer 
found to obviate this difficulty (Jansen et al 1993; Lorenz et al 1994). 

A rotor flux observer based direct vector control scheme as implemented in the laborator ;5 
is shown in figure 3 (Chattopadhyay etal 1992; Thakur 1996) using a CRPWM inverter wi tl 
flux and torque regulating loops. The vector rotator block implements the transformatioi 
from rotating to stationary axes followed by a 2/3 phase transformation resulting in thi 
following expressions 
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The speed loop control provides the torque command whereas the flux command 
selected according to the operating requirements in either constant torque or constai 
horsepower region. For CRPWM inverter, line currents are controlled in such a way as 
follow the reference current commands generated from the vector rotator. 
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Figure 3. A rotor flux observer based direct vector control scheme for an induction motor 
with a CRPWM inverter. 


to employ the slip relation (9) to compute the flux position relative to the rotor by summing 
a sensed rotor position signal with a commanded slip position signal 

= + (22) 



Figure 4 illustrates the basic structure of an indirect field orientation scheme using a 
CRPWM inverter (Thakur et al 1993; Thakur 1996). The commanded currents and 
are converted to stator referred reference currents by rotating to stationary and 2/3 phase 
transformations as in the case of direct field orientation. is controlled according to the 
desired torque and constant rotor flux, ij* is obtained from (16) in the steady state. 

Indirect field orientation, also known as flux feed-forward control, does not have inherent 
low speed problems and is preferred in most systems which must have zero speed. However, 
the inherent limitation is in the slip calculation which depends on the commanded machine 
parameters that may differ from the actual values during running condition of the drive. 
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simpler and cost effective as the differential equations involved are readily solved in real ' J 
time. Beginning with an 8-bit microprocessor, a 16-bit or a 32-bit or now a DSP, a transputer ^ 

or a custom-made LSI chip has become a part of the vector control hardware (Gabriel et al 
1980; Sathiakumar etal 1986; Mingbao etal 1987; Wu & Strangas 1988; Asher & Sumner 
1990; Ho & Sen 1990; Xu & Novotny 1991; Kao & Lin 1992; Lakaparampil 1994). 

Multi-microprocessor configuration has also been used to implement sophisticated con¬ 
trol structure (Harashima et al 1985; Saito etal 1987; Tzou & Wu 1990). The limitation 
in microprocessor application due to its finite word length, execution time and the oper¬ 
ational instructions must be taken into account in designing a processor based system as 
they affect significantly the performance of the system (Dote 1988; Jelassi et al 1992). A 
faster operation may be obtained in a hybrid scheme (Thakur et al 1993; Thakur 1996) 
using both analog hardware and microprocessor based controller where tasks such as 2/ 3 
phase transformation and PWM switching signal generation are achieved with hardware 
and the PI controller/observer design and implementation by a microprocessor with a PC- 
XT, Digital computer simulation technique is preferred to optimise the effects of various 
factors before implementation. Few typical simulation results as obtained and experimen¬ 
tally verified by Thakur (1996) are shown in figure 5 for both indirect and direct vector 
control. It is seen as expected that the performance of the latter is somewhat superior. 

2.4 Ejfects of motor parameter variations and adaptation 

Both the schemes described used machine parameters either in the calculation for the slip 
command for implementing the indirect vector control or to synthesise the flux vector to 
implement the direct vector control. In the indirect control the main problem is the rotor 
circuit time constant Lj.//?' which is sensitive to both temperature and flux level (Nordiri et 
al 1985; Krishnan & Bharadwaj 1991; De Doncker 1994). Direct field orientation systems 
are sensitive to stator resistance and total leakage inductance but, typically, the parameter 
sensitivity is less here than that with the indirect control, specially because of the flux 
regulation through feedback. With deviation of parameters, the field orientation is not per¬ 
fect and the controller should track the machine parameters. Several methods of parameter 
adaptation have been attempted (Garces 1980; Matsuo & Lipo 1985; Dalai & Krishnan 
1987; Krishnan & Doran 1987; Nilsen & Kazmeirkowski 1989; Bal & Grant 1992; Ghosh 
& Bhadra 1992), along with a number of identification schemes including Model Refer¬ 
ence Adaptive control (Ohnishi etal 1986; Holtz & Thimm 1989; Vas 1990; Bal & Grant 
1992; Moriera & Lipo 1993). Automated initial tuning in the form of self-commissioning 
technologies has also been developed (Khambadkone & Holtz 1991; Lorenz et al 1994; 
Borgard et al 1995; Yanagawa et al 1995). Recent work on the on-line tuning to improve 
the robustness of vector control induction motor has been reported using special torque 
control strategy (Noguchi et al 1997; Tadakuma et al 1997) and feed forward/feedback 
control with neural network. A new flux and stator resistance identifier for ac drives has 
been proposed by Kerkman et al (1996). 

Two new approaches to induction motor field-orientation are presented in Matsuo 
et al (1994) which employ rotor end ring current phase detection to make the controller 
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Recently, fuzzy and neural network-based estimators of feedback signals such as ro¬ 
tor flux, unit vectors and torque for indirect and direct vector control schemes have been 
reported (Miki et al 1991; Sousa & Bose 1993; Simoes & Bose 1995). These have the 
advantages of faster execution speed, harmonic ripple immunity and fault tolerance char¬ 
acteristics compared to a DSP based estimation. 

2.7 Direct torque control (DTC) 

The latest control method developed and commercialised by ABB, Sweden from the con¬ 
cept of the field-oriented or vector control is direct torque control (DTC), a patented concept 
developed again in Germany by Depenbrock (1988). The basic control scheme is shown 
in figure 6 when both the flux and torque are controlled by a hysteresis controller (Tiiti- 
nen et al 1996; Nash 1997). The delays associated with the PWM stage are eliminated 
since the PWM modulation is replaced by an optimal switching (Space PWM) logic. The 
adaptive motor model estimates the actual torque, stator flux and shaft speed as well as the 
frequency. The flux and torque are calculated every 25 ps and the speed and the frequency 
once per millisecond. The input to the motor model includes the motor current for two sta¬ 
tor phases, line voltage and power switch positions. The optimal switching logic is realised 
by ASIC hardware (ACS 600). The switch information for the power module is utilised in 
the calculation of the appropriate voltage vector which will satisfy both the torque status 
and flux status outputs. This method results in a better torque response than the flux vector 
control and, in addition, assuming moderate speed accuracy is acceptable (typically 0.1— 
0.3%), the need for a pulse encoder is eliminated. Implementations of special functions 



Figure 6. Direct torque control (DTC) scheme. 
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flying start, flux braking, flux optimisation and powerloss ride through are all made 
er with this control approach, as claimed. 

Doubly-fed and multiphase induction motor control 

vector control of a doubly-fed slip ring induction motor in a Scherbius scheme as used 
igh power pump drives with a current controlled cycloconverter in the rotor side is 
)ly described by Vas (1990) and Bose (1986) for super/sub-synchronous speed control, 
same system can be used for VSCF generation systems, where the control strategy 
ains the same except that the active and the reactive currents of the cycloconverter are 
trolled to control real and reactive powers, respectively, at the stator terminals by the 
[back method. A novel control strategy to realise torque and reactive power control of 
)ubly excited induction machine with position sensorless scheme using rotor voltage 
currents as feedback signals has been proposed recently by Xu & Cheng (1995). 

L strategy for improvement of the reliability for vector-controlled induction motor drive 
1 a modified topology where the neutral point is returned to the midpoint on the dc link 
roposed by Liu et al (1993). This allows for continuous disturbance-free operation of 
drive even with complete loss of one leg of the inverter or motor phase. This method 
been extended to field-oriented control for a multiphase induction machine with an 
alanced stator winding structure (Zhao & Lipo 1996) 

Vector control of synchronous motors 

Brief application survey 

ile vector-controlled induction motor drives have been used mostly in the industry for 
hum power ranges, vector-controlled synchronous motor drives are either in the very 
1 power range (1-10 MW) wi th wound-field machines fed from cycloconverters or in the 
kilowatt range with permanent magnet synchronous motors (PMSM) or synchronous 
ctance motors for servo drives. The control of synchronous motors is different from 
of induction motors primarily due to the fact that in the former, the magnetising current 
be supplied from the field side independently of the armature current and the space 
ition of the field is located by the position of the rotor. Additionally, the steady state 
between the rotor (which usually carries the field winding) and the controlled flux 
tor vanishes in the steady state. Therefore, the indirect or the feed-forward type of 
tor control as used extensively for the induction motor drives does not apparently seem 
ious for a synchronous machine. The ‘transvector control’ as applied to a synchronous 
;or by Bayer et al (1972) is essentially a direct type of flux feedback control where the 
or current is orthogonally oriented with respect to the stator flux vector to achieve unity 
idy state power factor. The decoupling is achieved by a closed loop flux feedback in 
ition to feeding a part of the magnetising current from the stator during the transient 
:ompensate for a sluggish field current change. Siemens has reported (Timpe 1982; 
imann 1992) the development of vector controlled cycloconverter-fed subsynchronous 
tor drives for use in reversing rolling mills to achieve high dynamical control response, 
•wn Boveri reported the development of the first gearless tube mill (Blauenstein 1970; 
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Stemmier 1970) using flux feed-forward control scheme. Terens et al (1982) used both 
static and dynamic flux models to control a similar drive. Nakano et al (1984) reported the 
development of a high performance synchronous motor drive for a rolling mill, with an 
open-loop flux estimator and PI current controller. An airgap flux oriented vector controlled 
cycloconverter drive was developed by Hill et al (1987) for an icebreaker. A very good 
survey of field-oriented control of synchronous machines including various applications 
has been made by Novotny & Jansen (1991) with a discussion on the difference between the 
‘space angle control’ (SAC) relevant to self-synchronous commutatorless motor (CLM) 
and the true field-oriented (FO) or vector control. While in the former the angle of the 
armature current vector with respect to the field axis may be other than 90°, in the latter 
it is strictly restricted to 90°. Earlier, high power drives using CSI converters and wound 
field-synchronous motors with load commutation for fan and compressor drives were 
SAC systems utilising a rotor position detector to cause the power converter to supply 
stator excitation in synchronism with the induced voltage from the field excitation. PMSM 
motors operated in tme FO system as used for servo drives and machine tools are reported 
by Kaufman et al (1982) and Wescheta (1983). 


3.2 Synchronous machine model and vector control implementation 

3.2a Wound field synchronous motor model: The d-q model of a wound field salient 
pole synchronous machine with damper windings in Park (rotor) reference frame is 
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The field circuit parameters Vjr^, i'^^, refer to the stator. For steady state, p- 

terms vanish. 
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3.2b Vector control and angle control: The rotor position feedback and vector control 
of the motor stator current to maintain the space angle between the field winding and 
the stator ramf results in stator currents that translate to set values of iqs and ids in the 
rotor reference frame. This is due to the instantanenous control of the phase of the stator 
current to always maintain the same orientation of the stator mmf vector with respect to 
the field winding in the d-oxis of the d-q model. The resulting axes current are shown in 




I 

(a) d-axis (b) 


Figure 7. Phaser diagram of synchronous machine currents in d-q axes; (a) space angle 
control, (b) field orientation (y = 0, Ids — 0)- 


figures 7a and b (Novotny & Lipo 1985) for space angle control and the field orientation 
(when y = 0, Ids = 0). Note that for field orientation the field current in the <i-axis and 
the stator current in the ^-axis are 90^^ apart. 



3.2c Implementation with CSI and CRPWM inverter: The implementation calls for 
control of magnitude and phase of the stator current with respect to the location of the 
field winding axis. Figure 8 shows a direct implementation (y = 0) using absolute rotor 
position sensing and a CSI. With y = 0, the stator current is entirely ^-axis current and is 
equivalent to a torque command. The y * command is entered in the ‘phase regulator’ block 
and the drive can be operated at other than y* = 0. Figure 9 shows a simple means for 
implementing torque control with independent ij'-axis and J-axis currents using a CRPWM. 
The absolute rotor position information is used to convert the and commands in the 
rotor reference frame to a stator reference frame - which become the current commands 
for the CRPWM. Normal field orientation is obtained by setting = 0. Varying z^* 
provides control of power factor and other varying performances. The ‘rotor to stator 
transformation’ block in figure 9 implements the same equations as in (21). 

3.2d Implementation with cycloconverter: Cycloconverter-fed synchronous motors 
have been preferred for low speed large power drives e.g. mine hoist winders, gearless 



Figure 8. A synchronous motor vector control scheme using a CSI (y = 0, = 0). 



























lent mill drives, rolling mill drives, ship propulsion drives etc. Synchronous motors 
'e been preferred in these drives rather than induction motors because of their power 
tor and large torque capability at low speeds. Furthermore, a naturally commutated 
doconverter, compared to an inverter, provides a near-sinusoidal current excitation re¬ 
ting in negligible torque ripple, inherent four quadrant capability, robustness and large 
ver handling capability. 

^ stator flux-oriented vector control scheme which is an improvement of that in Bayer 
il (1972) and Nakano et al (1984) for a 6 -pulse non-circulating current cycloconverter- 
synchronous drive with a flux observer has been developed recently by Das (1996) 
I Das & Chattopadhyay (1997) for a rolling mill drive. Figure 10 shows the implemen- 
on scheme which aims at a control that maintains a spatial orthogonality between the 
sc vector -^s and the armature current vector ia as shown in the space phasor diagram 
figure 11. The reference speed and reference flux commands are given to the vector 
itroller that generates the reference analog voltages for the cycloconverter (through the 
Tent controller) and the field converter. The stator flux is estimated by a closed-loop 
luced order observer. Referring to figure 10, Ci is the speed controller that generates 
torque command which is divided by the stator flux to generate the torque command 
current igT ■ The magnetisation current along the flux axis (i'^) is obtained from a flux 
Itroller C 2 . The transient stator flux component of current ism is obtained from the re- 
Lonship, — i/d cos S, which decays down to zero in the steady state. The steady 

te displacement angle is decided by the displacement angle controller. The set value 
the field current is obtained from the relation, ~ cos 5. C 3 is the field current 
Itroller that generates the control voltage for triggering the field converter. The vector 
ator (VR) transforms the vector from two axes flux - torque reference frame to abc 
tionary reference frame. The observer and the control circuit design aspects together 




Figure 11. Space phasor diagram for vector controlled synchronous motor. 
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Figure 12. Speed and current responses to a step reversal (200 to —200 rpm) of a vector 
controlled cycloconverter fed synchronous motor drive: (a) simulation results, (b) experi¬ 
mental results. 

with the PC-based implementation are detailed in Das (1996) and Das & Chattopadhyay 
(1997) and typical results obtained are shown in figure 12. 

Vector control of a synchronous motor can be made with respect to three flux Jmm f vec¬ 
tors, namely, the stator flux, the damper flux and the field mm/. A unified analysis of these 
three schemes by Das (1996) shows that the stator flux orientation results in a unity power 
factor which is not the case with the other schemes. A damper flux orientation scheme which 
is comparable to rotor flux orientation in the induction motor drive has been recently re¬ 
ported by Chongjium et al (1995) without any detailed analysis. Orientation with field mmf 
results in a lagging motor terminal power factor and is not expedient for high power drives. 

3.2e Saturation and damper effects: Magnetic saturation effects of the ^^-axis and q- 
axis for a damperless, salient pole stator flux-oriented wound rotor synchronous motor 
drive have been studied by Brass & Mecrow (1992) by developing a saturated flux model. 
In a separate paper (Brass & Mecrow 1993), the effect of damper windings on field-oriented. ^ 
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Figure 13. Vector control scheme for a permanent magnet synchronous motor. 

synchronous motor has been studied and it is shown that the presence of damper windings 
improves the torque dynamics. 

3,2f Vector control of permanent magnet synchronous and reluctance motors: Perma¬ 
nent magnet machines, both surface-mounted and interior-magnet types, are extensively 
used in servo drives and robotic applications, and vector control provides smooth torque 
operation of these motors through the entire speed range, including zero, with high power 
factor. The control schemes for these machines have been extensively discussed in Vas 
(1990) and so are not detailed here. In a permanent magnet synchronous motor, the ro¬ 
tor field flux irf and the corresponding equivalent field current If can be considered as 
constant. For surface-mounted machines, the saliency and armature reaction is negligible. 
Therefore, and for maximum torque sensitivity with stator current /J* = 0, and 

1* = Figure 13 shows the vector control principle for the PMSM derived from the 
induction motor control diagram with the modifications (Bose 1986), Wsi =0, 9j. =9g. 
A microprocessor-based field-oriented control scheme for a permanent magnet hysteresis 
synchronous motor is presented in Qian & Rahaman (1993). 

Synchronous reluctance motor drives have recently received renewed attention due to 
the application of field oriented control to these motors (Boldea et al 1991; Xu et al 
1991; Matsuo & Lipo 1993). Excellent control perfonnance of the drive systems has been 
obtained though there exists a limitation in the field weakening range. 

4. Conclusions 

The vector control of uc drives in which there have been a spurt of activities, has .by 
now gained maturity but still continues to provide interesting and challenging scope for 
innovations to researchers and application engineers. This paper has made an attempt to 
make a summary review of the activities on various aspects in this important field for control 
of both induction and synchronous machines till date with the informations available in 
the published literature. It is expected that the review will help those interested in the 
development of efficient and high performance drives of the future. 
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Abstract. The objective of this paper is to review the state-of-the-art and 
recent developments in Switched Reluctance Motor (SRM) drives. The interest 
for improved performance and reliability has motivated many SRM advances 
in the recent years. Even after almost 30 years of research in SRM, which 
might appear to be the simplest of all machines, there remain critical issues to 
be explored to gain deeper insight into the SRM technology. The paper briefly 
discusses the historical background and the basic operating principles of the 
motor. The topics discussed include the current state of research in converter 
topologies, control algorithms, torque ripple, noise, and sensorless operation. 
Recent advances in the field of SRMs indicates that they will have an increasing 
influence in the area of variable speed drives in the coming decades. 

Keywords. Switched reluctance motors; converter topologies; control 
algoridims. 


1. Introduction 

Switched Reluctance Motor drives (SRMs) are relatively new entrants in the rapidly devel¬ 
oping variable speed drives market. They are inherently variable speed drives which have 
simple construction, wide speed ranges, good energy efficiencies, high torque to inertia 
ratios, and high torque to power density ratios. The simple structure of SRMs will likely 
make them less expensive than the equivalent variable speed drives in mass production. An 
SRM has the flexibility of operating as a four-quadrant drive with independent control of 
speed and torque over a wide speed range. Their wide torque and speed range eliminates 
the need for expensive and troublesome mechanical gears and transmissions. Figure 1 
shows a selection of SRMs with power ranging from 100 W to 75 kW and speeds ranging 
of 250 to 30000 rpm (Lawrenson 1992). 

1.1 Historical background 

SRM is the modem version of the ‘electromagnetic engine’ which dates back to the late 
1830’s. The modem era of developmental SRM started in 1972 when it was patented by 
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Figure 1. Assortment of SRMs. 


Bedford. SRMs received considerable attention after the exemplary work done at the Uni¬ 
versities of Leeds and Nottingham in the 1980s (Ray & Davis 1979; Lawrenson et at 1980; 

Davis et al 1981). This spurred a series of research efforts all over the world, especially in 
Europe and in die US, resulting in several publications, patents and applications (Lawren¬ 
son 1992) which is illustrated in figure 2. Even after almost thirty years of research in 
SRM, which appears to be the simplest of all machines, there remain some critical issues 
which need further study. 

1.2 Basic principles of operation 

SRMs are structurally similar to Variable Reluctance Stepper motors, but they differ in 
the following aspects: 

• stator phase currents are switched based on the rotor position feedback; 

• the machine is designed to operate efficiently for a wide range of speed. 

SRM has salient poles on both stator and rotor with concentrated windings only on the 
stator and no windings on the rotor. Windings on the diametrically opposite stator poles ^ 
are connected in series. Figure 3 shows a typical 8/6 SRM. Currents in the stator windings 
are switched on and off in accordance with the rotor position feedback. 

The basic principle of operation of SRM is like any other reluctance motor — torque is 
produced by the tendency of the rotor to align itself to the minimum reluctance position. As 
this is independent of the direction of the current in the stator windings, the power converter 
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80^81 82/83 8/./85 86/87 88/89 90/91 92/93 

Figure 2. Approximate number of publications, patents and applications. 
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circuit can be simplified. The radial magnetic attraction that operates the SRM is about ten 
times larger than the circumferential forces that operate an induction motor. The torque- 
speed characteristics of a SRM replicates a conventional DC machine characteristics under 
fixed firing switching strategy. Typical torque speed characteristics and variation in the flux 
hnkage with current at different rotor positions are shown in figures 4 and 5. 

2. Advances in converter design 

SRMs cannot operate directly with a dc supply or the standard sinusoidal ac supply 
available ‘off the wall’. Hence it is important that the design of the converter should be 



i 

Figure 5, Flux'/current/position curves. 
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coordinated concurrently with the design of the motor to obtain an optimal design of the 
drive as a whole. Unlike the motors that operate with sinusoidal voltages and currents, the 
converter topology in an SRM is dependent on the machine design. The topology depends 
on the motor configuration, the number of stator and rotor poles and the associated con¬ 
duction and overlap angle (Vukosavic & Stefanovic 1991). In general, the starting torque 
requirements decides the current rating and the maximum speed of the motor decides the 
voltage rating of the inverter. 

An ideal converter must satisfy 

• low switches per phase ratio, 

• ability to supply and control a commanded current independently and precisely, 

• flexibility in adapting to any number of phases (odd or even), 

• low VA rating for a given rating of the drive, 

• robustness and reliability, 

• good efficiency, 

• ability to operate in all four quadrants effectively, 

• less torque ripple and noise. 

The most commonly used converter types are the classic half-bridge type converters and 
the split phase type converters. The classic type converter is the most flexible type converter 
but it requires a greater number of switches. The split phase type converter requires an 
even number of phases and has high active device rating, and hence is suitable only for low 
voltage and low power applications. Another type of converter suitable for star connected 
type SRM is the dual-decay type converter with flexibility of control in freewheeling mode 
and reduced device ratings (Ehsani et al 1993). Most of the converters developed recently 
are aimed at reducing the number of switches and are found to be more application-specific 
with some trade-offs. The different converter topologies are shown in figure 6. 

3. Advanced control schemes 

A reliable SRM drive system must have the following characteristics 

• parameter insensitive control characteristics 

• quick precise dynamic response with no overshoot 

• rapid recovery from transient disturbances 

Conventional linear controllers are quite sensitive to plant parameters. This along with 
the need for optimum performance of SRM like maximum efficiency, maximum torque 
and minimum torque ripple calls for sophisticated control strategy. The important control 
variables in an SRM, switch-on and switch-off angles, are a nonlinear complex function of 
many motor parameters and thereby require fine tuning (sophisticated control) for optimal 




V 



V 



V 


c) 


d) 


e) 

Figure 6. SRM converter topologies, (a) 
Bridge, (b) bifilar, (c) split capacitor, (d) 
Miller, and (e) C-dump circuits. 

surface in the state space. These methods are shown to provide a better torque ripple 
charactenstics and are insensitive to parameter variations and disturbances. Artificial neural 
networks (ANNs) have been used successfully in the control of nonlinear dynamic systems 
(Reay et al 199?>). The capability to accommodate accurate nonlinear modeling has made 
Artificial Neural Networks (ANNs) an ideal candidate to solve the control strategies of 
1 erently nonlinear SRMs. Fuzzy logic controllers are gaining interest recently in the field 
of nonlinear control (Bolognani & Zigfiatto 1993). They offer the following advantages - 
ey 0 not require an accurate model of the plant, they can be designed on the basis of 
linguistic information obtained from the previous knowledge of the control of the machine 
and give better performance results than the conventional controllers. Amor et al (1992) 
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suggested an adaptive control based on feedback linearization for less torque ripple (only 
position control) later used fifth order model (Amor et al 1993) to control speed and 
torque for adaptive control. A machine independent method control to minimize energy 
consumption is discussed by Kjaer et al (1994). 

3.1 Need for seif-tuning control 

The following are the assumptions (Ehsani & Ramani 1993) made in most of the conven¬ 
tional control methods: 

• the inductance is symmetrical about the aligned position, 

• all the rotor and stator poles have perfect symmetrical tooth structures, 

• the inductance variation is time and temperature independent, 

• saturation has no or very less effect on the inductance variation, 

• the inductance profile is identical for all the phases, 

• the inductance profile is identical for all the machines of the same rating in mass pro¬ 
duction. 

In practice the above assumptions are not valid and these factors affect the sensitiv¬ 
ity of a control algorithm. Control methods that can adapt to the parameter drifts are 
required to optimize the machine performance. Recently the problem of obtaining opti¬ 
mal performance from an SRM in the presence of parameter variation has gained con¬ 
siderable interest (Tandon et al 1996). New self-tuning algorithms which optimize the 
steady-state performance of the drive as measured by Torque per Ampere (TPA) have 
been introduced. It has been realized only recently that due to parameter variation and 
drift, the phase inductance profiles can significantly differ from the design data (Ehsani 
& Ramani 1993; Ehsani et al 1993). Hence, it becomes necessary to use a controller with 
self-tuning capability if optimal performance of the SRM drive is to be maintained. Max¬ 
imum TPA is desirable for any drive application because the motor may be described as 
a ‘current to torque transducer’. Specifically, the following problems arise in practical 
SRMs. 

• Due to manufacturing tolerances the inductance profile varies by as much as 10% from 
phase to phase of the motor and also from motor to motor with the same design and 
rating. It is noted that the minimum inductance does not show any significant variation. 
This is due to the very large air-gap at the unaligned position. However, the maximum 
inductance occurs at the aligned position where the air gap may be less than 1mm. 
Therefore any minor variations in the air-gap show up as change in the maximum 
inductance for each phase. Thus, the slope of the inductance profiles will also differ and 
the torque production is affected. 

• With time, there will be wear on the bearings, as a result of which the air-gap may change 
or acquire a small eccentricity. Again, this has an impact on the maximum inductance 
of each phase. 




Figure 7. Control block diagram with self-tuning algorithm. 


Hence, the optimal values of the tum-on angle calculated off-line are sufficient for the 
TPA maximization. An on-line, self-tuning algorithm to determine the optimal value of the 
turn-off angle in the presence of parameter variations which alter the inductance profiles 
has proven to produce superior steady state performance. Figure 7 shows the block diagram 
of the above mentioned self-tuning control. The inherent simplicity of this new approach 
makes it ideal for real-time implementation in a digital control system. The control scheme 
is applicable to any SRM drive operated with a shaft position sensor, and does not depend 
on the number of phases, poles or HP of the motor. 


4. Advances in sensorless operation 

Closed loop operation of an SRM requires rotor position information for satisfactory 
performance. Conventional methods used for position sensing include resolvers, inductive 
or Hall effect sensors and optical encoders. These have the disadvantages like additional 
cost, additional electrical connections, mechanical alignment problem, less suitability to 
space restricted application and the significant disadvantage of being a potential source 
of unreliability. These lead to the research in sensorless operation of SRMs, resulting 
in several techmques in the past two decades. Most of the existing methods extract the 
rotor position information from the measurable electrical parameters. These techniques 
elinunate the requirement of the conventional position sensors, thereby increasing the 
reliability of the motor drive system considerably. They can be classified as below. 


(1) Active probing methods: These methods utilize the responses of diagnostic signals 
injected into the passive (un-energized) phase of an SRM. These methods are suitable for 
low speeds as the time window for passive phase measurement reduces at high speeds. 
Usually, the phase to be energized next is diagnosed for position estimation. They can be 
further classified as (a) Linear relation methods, and (b) inverse relation methods 










(a) Linear relation methods - In these methods, the signal containing position information 
is directly proportional to the phase inductance. In a typical method the rate of change of 
phase current, which is influenced by the incremental inductance, is monitored (Acamley 
et al 1985). Rotor position can now be deduced as the incremental inductance is a function 
of rotor position. This has an advantage of deducing the rotor position even at zero speed. 
Another method which is robust to switching noise was presented by Ehsani and coworkers 
(Ehsani & Ramani 1994; Ehsani et al 1994) and called the PM encoder technique. A 
sinusoidal carrier voltage signal of frequency much higher than that of the frequency 
of variation of inductance is chosen. Thus, the transient variation of the current phase 
will contain the information about the dynamic motor winding inductance. This encoded 
inductance information is decoded using zero crossing detectors for the voltage and current. 
The demodulator generates a square wave signal whose pulse width variation represents 
the phase inductance variation. 

Also a modified PM encoder technique is presented which is suitable for a wide range 
of speeds. Mathematical analysis and simulation results show that PM technique is more 
sensitive for lower values of inductance and AM technique is more sensitive for higher 
inductance values. To achieve a better sensitivity, a level crossing detector is used instead 
of a zero crossing detector. The level crossing detector is set to a threshold value. Now the 
square wave output corresponds to the phase angle variation with respect to the threshold 
value other than zero which gives better sensitivity than the above PM method. 

(b) Inverse relation methods - In these methods, the position information encoded signals 
are inversely proportional to the phase inductance. At high speeds, the motional EMF 
is very high, the current will never reach the rated value resulting in a ‘single pulse’ 
mode. In this mode, the current gradient in the next phase to be excited has the position 
information (Acamley et aM985). At the tum-on position, as the resistive voltage drop 
and the motional EMF are negligible, the initial rate of change of current is inversely 
proportional to the incremental inductance which gives the position information directly. 
The sensorless operation is implemented by comparing the initial current gradient with 
a optimal current gradient. In another method called the Amplitude Modulation (AM) 
method (Ehsani et al 1990; Ehsani 1991), the position information can be obtained from 
the amplitude of the current as it is directly proportional to the inductance variation. In 
this method, the envelope of the modulated current signal is detected. In addition, the 
information can be decoded by measuring the amplitude in terms of angles using a level 
crossing detector. 


(2) Non-intrusive methods: In these methods, the rotor position is obtained based on 
the measurable parameters without using any diagnostic or probing signals. Neglecting 
R, at low speeds the incremental inductance I = dV^/di is a function of 6 for constant 
i. Therefore, ‘d?’, rise time or fall time, can be used to obtain the position information. 
The flux linkage curve information on a multi-dimensional look-up table can be used 
to determine rotor position (Hedland 1986,1992). In a more recent method called active 
phase vector method (Ehsani et al 1994), a composite vector which is directly proportional 
to the inductance is obtained based on the discrete form of voltage equation for different 


(3) Open loop or synchronous control method: These methods are based on synchronous 
control and do not actually provide any position information. The motor is run from a 
variable frequency oscillator and change is made only to the dwell angle to improve the 
stability (Miller et al 1985). 


(4) Other methods: Some other methods are as below. 

(a) Observer based - Observer method reconstructs the state of the SRM drive system on 
the basis of known system inputs and system measurements (Lumsdaine & Lang 1990). 
Measurements of input voltages and currents were utilized. An accurate mathematical 
model including mechanical load (in state space form) to estimate current, flux linkage, 
speed and rotor position which was compared with actual current and error adjustment 
made using an adjustment matrix to estimate the position is used. 

(b) Mutually induced voltage based methods-In this method, the mutually induced voltage 
in an un-energized phase due to current in energized phase is monitored to obtain the 
position information (Hussain & Ehsani 1994). 

(c) Design based method - This method is based on altering slightly the structure of 
at least one of stator and/or rotor pole faces which will introduce a perturbation in the 
inductance profile of the motor while it is miming (Bartos et al 1993). The perturbations 
can be produced by introducing a notch or bump in the stator and/or rotor pole faces. The 
frequency of these perturbations gives direct information on the speed of the motor. 

The corruption' of position information due to the secondary effects of the existing sen¬ 
sorless methods must be considered to improve accuracy. It is found that a trade-off exists 
between extensive computation and good resolution in sensorless position sensing. There 
is still much room for improvement as there are no accurate, commercially applicable 
methods available. The need for inexpensive, rehable, indirect position sensing technique 
for a wide range of speed, still exists. The advances made in the field of power electronics, 
motion control and signal processing can be used to improve the commercial applicability 
of the existing methods. The existing computationally intensive, high resolution sensorless 
techniques can be made commercially viable in the future with the advances made in the 
computational power of Digital Signal Processors. Further development resulting in com¬ 
mercially applicable, inexpensive techniques is expected. More research will be necessary 
for a method with good positional accuracy and suitable for commercial application. 


5. Overview of critical issues 

Apart from the numerous advantages, SRMs are also known for their high torque pulsations, 
high acoustic noise and reliability issues due to sensor based operation. 
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5.1 Torque ripple 

The nonlinear coupling between the rotor position, phase current and overlap angle and the 
doubly salient geometric structure of the SRM are the intrinsic causes of torque ripple in a 
SRM. Torque ripple is very undesirable in low speed and servo type applications. Several 
methods have been developed to reduce torque ripple based on machine design or control 
strategies. 


5.1a Design based methods: Torque ripple can be minimized by suitably designing the 
magnetic structure of the machine (Tormey & Torrey 1991). Comprehensive procedures, 
beginning with the fundamental selection of pole numbers and geometry, for designing 
SRM drives for low torque ripple applications has been presented in the literature (Wallace 
& Taylor 1990). Values for specific torque are used to estimate the required SRM size and 
are obtained from empirical data or from an analytical estimation method. Pole numbers 
are chosen based on the speed and torque-ripple specifications for the design. The pole 
numbers define a range of feasible pole arc combinations. The centre pole arc values are 
chosen as a starting point. The current density in the phase winding is also chosen based 
on the thermal constraint of the application. The pole arcs and motor dimensions which 
yield minimum ripple are selected as the candidate design which is further evaluated using 
the dynamic SRM model. 


5.1b Control based methods: Classical linear controllers cannot eliminate the torque 
pulsations. The fundamental approach is by optimal current profiling that reduces torque 
pulsations. Torque ripple can be reduced by using a current-tracking control method in 
which the desired stator currents are computed by linearizing and decoupling transforma¬ 
tion. The shape of static torque-angle-cument characteristics of SRM drive can be fully 
determined by a series of measurements performed with the drive in a self-learning mode 
(Kavanagh et al 1991). Based on this, the current required to obtain the optimum torque 
contribution from each phase, at each rotor position, can be determined for a smooth torque 
performance. A single input, linear, decoupled output torque controller based on optimal 
precalculation of the phase current profile provides low torque ripple (Schramm et al 
1992). A bi-cubic spline interpolation was used to model the nonlinear experimental data. 
This method optimizes the current overlap at all torque levels so as to minimize the peak 
phase current. This current profiling algorithm results in the highest possible operating 
speed range under constant torque operation. The torque output is decoupled single-input 
linear function of torque input demand. Neural techniques can learn the current profiles 
required to minimize torque ripple and to satisfy other performance criteria on-line (Reiay 
et al 1993). Torque measurement is required to train the neural network. PWM current 
control can be used for smooth operation of an SRM drive (Hussain & Ehsani 1994). The 
torque pulsations during commutation are minimized by a current control strategy which 
allows simultaneous conduction of two positive torque producing phases over an extended 
predefined region. The effects of saturation can also be taken into account. 






Acoustic noise levels in SRMs are relatively high when compared to other ac drives. The 
acoustic noise has both magnetic and mechanical origin (Cameron 1992) . The possible 
sources include radial attractive forces between the rotor and stator, stator vibrations in¬ 
duced by the torque ripple, stator winding vibrations induced by the interaction of the stator 
current and the local magnetic field, magnetostrictive forces in the stator laminations, un¬ 
balanced magnetic and mechanical forces on the rotor due to manufacturing asymmetry 
windage and bearing vibrations. Of these, the dominant one is shown to be the radial vi¬ 
brations of the stator. Solving the noise problem can be approached either from the motor 
design point of view or the control point of view or a combination of these two. Acoustic 
noise can be reduced to any required level by current shaping or by introducing dither into 
the tum-on and turn-off angles. In another technique a chopper is introduced between rec¬ 
tifier and converter to reduce the phase voltage with respect to the speed (Pillay et al 1994). 
The voltage smoothing method reduces the rate of change of radial force and produces a 
smaller vibration (Pollock & Wu 1995). A three-stage commutation technique has been 
described in the literature which cancels the stator vibrations when the power converter 
does not have a zero volt loop by employing the three stage commutation technique at the 
beginning and end of the zero volt loop. The active cancellation methods like the three stage 
commutation technique are superior ,to the voltage smoothing method since they allow the 
energy to be dissipated in subsequent vibrations which can completely oppose each other. 
Research in this area is still in its infancy and further developments are expected in the 
near future. 

6. Current state of research 

6.1 SR generators 

Fukao (1986) discussed the principles and output characteristics of a super high speed 
reluctance generator system. Cameron etal (1992) discussed the computer-aided design of 
a VR generator and later about the control aspects of a high speed VR generator (Cameron 
& Lang 1992) system, Radun (1994) discussed the analysis and design of an aircraft 
engine starter/ generator. He proposed different excitation systems including one for three 
phase generation. Torrey & Hassanin (1995) present the design methodology for low-speed 
VRGs for renewable energy systems. 

6.2 Fault tolerance 

SRMs inherently have a good fault tolerance capability in the sense that they can operate 
with faulted motor windings or inverter circuitry. The magnetic independence of the motor 
phases and the circuit independence of the SRM converter gives SRM that capability. 
Stephens (1991) discussed the different types of faults that can occur and suggested a 
method to sense and isolate the faulted winding. Miller (1995) analysed SRM under faulted 
condition using FEM, magnetic modeling, actual experiments as well as a simulation 
software. 
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6.3 Losses and temperature rise 


Core losses in SRM forms a significant part of the total losses. Materu & Krishnan (1988) 
presented a method of calculating the iron losses based on the flux density waveform 
including saturation and presented a computationally intensive procedure using FEM time¬ 
stepping. Metawally et al (1988) discussed the core loss in SRM based on experimental 
results. Faiz & Dadgari (1991) analysed the heat distribution and thermal calculations 
for multitooth SRM under natural and forced cooling condition and compared it with 
experimental results. Corda & Olaca (1993) predicted (analysed) losses in the converter 
of SR drive under different modes of operation. The prediction of converter losses, for 
a given converter topology and method of control, is useful when selecting the type of 
power semiconductor switching components and design parameters of control with respect 
to efficiency of the drive. The prediction of converter losses is also useful for the correct 
thermal design. A new approach to calculate core losses was developed by Hayashi & 
Miller (1994) in matrix form using the Steinmetz equation. 

7. Applications 


SRMs have been successfully applied to a variety of applications resulting in high per¬ 
formance drives. Figure 8 shows typical efficiency vs. speed for curves for various 30 HP 
motors. The first successful application was as general purpose industrial drives. More 
recently, they have been commercially applied to automotive utility motors. Good fault 
tolerance and ability to operate in harsh environments made SRMs successful candidates 
as coal-shearing machines, textile spinning drives, friction welding machines, food pro¬ 
cessor applications, plotters, aerospace and automotive applications. SRMs of 5 MW at 
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50rpm to lOkW at 100,000 rpm have been built and tested successfully. Speed range of a 
SRM can be from 100; 1 to 1000:1. 

The fault-tolerance capability of SRMs is extremely good which makes it suitable for 
aerospace, automotive and industrial applications. The independence of each phase wind¬ 
ings and the absence of shoot-through paths contributes to the fault tolerance of the SRMs. 
Typical applications include traction, domestic appliances, mining, servo type and battery- 
powered applications. The application areas of SRM are rapidly expanding as the SRMs 
can compete virtually in any industrial or domestic drive market. 

8. Conclusions 

The state-of-the-art SRM drive technology and the recent advances made in the research 
of SRM were discussed. A wide range of topics including converter topologies, control 
algorithms, sensorless operation, fault tolerance, noise and vibration issues were presented. 
From the above discussions it can be clearly seen that the SRM drive technology has come 
a long way. Further research is necessary in some critical ai*eas for the SRM to be made 
commercially acceptable as a variable speed drive. Recent advances in power electronics 
and control systems are set to make SRM a commercially acceptable drive in the immediate 
future. 


The author gratefully acknowledges the significant contribution of Mr. Anandan 
Velayutham Rajarathnam in this work. 
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Abstract. This paper deals with the latest developments in Permanent Magnet 
Brushless DC (PMBLDC) motor drives. A comprehensive account of the state- 
of-the-art on types of construction of the motor, closed loop controllers in 
position, speed and current/torque control and recent trends in inverters, sensors 
etc. are given. Techniques for mechanical sensors elimination are discussed in 
detail. Special efforts made to reduce torque ripples, noise and vibrations are 
described. The impact of microelectronics through integrated chips used in the 
control of PMBLDC motor drives is given. The increasing applications of this 
drive due to improved performance and its cost reduction are also enlisted. 

Keywords. PMBLDC motor; sensors; controllers; sensorless operation; 
torque pulsations. 

1. Introduction 

Permanent Magnet Brushless DC (PMBLDC) motors are increasingly being used in a 
wide spectrum of applications such as domestic equipments, automobiles, information 
technology equipment, industries, public life appliances, transportation, aerospace, de¬ 
fence equipment, power tools, toys, vision and sound equipment and medical and health 
care equipment ranging from microwatts to megawatts It has become possible be¬ 
cause of their superior performance in terms of high efficiency, fast response, light weight, 
precise and accurate control, high reliability, maintenance free operation, brushless con¬ 
struction, high power density and reduced size. Recent developments in PMBLDC motor 
technology in terms of availability of high performance rare earth PM materials, vary¬ 
ing motor constructions such as axial field, radial field, package type, rectangular fed, 
sine fed motors, improved sensor technology, fast semiconductor modules, low cost high 
performance microelectronics devices, new control philosophy such as robust, adaptive, 
fuzzy, neurail AI based controllers, have been a boon to their widespread use in the large 
speed ranges from few revolutions to several thousand revolutions per minute (rpm). They 
have been proven most suitable for position control in machine tools, robotics and high 


precision servos, speed control and torque control in various industry and process control 
applications 7,8, 21-24 

Inspite of being one of the best, the PMBLDC motor has faced many hurdles to come 
to its present stage in terms of cost, torque ripples, noise, vibrations, reduced reliabil¬ 
ity due to the large number of components, operational constraints such as temperature 
rise etc. Continuous efforts have been made to overcome these problems on different 
aspects of this drive. The PMBLDC motor drive is undoubtedly quite a big mission in 
itself; this paper conceritrates on the recent advances in PMBLDC motors in terms of 
motor construction, closed loop controllers, semiconductor power modules, sensors and 
their reduction, torque ripple minimization, impact of microelectronics, cost reduction and 
potential applications. 

2. Latest developments in the PMBLDC motors 

Permanent magnet (PM) excitation has been used in place of dc excitation in different 
electric machines such as dc machines, synchronous machines and new PMBL machines 
such as PM stepper motors, hybrid stepper motors and PMBLDC motors. High cost of PM 
materials has been a major bottleneck for use and development of these electric machines. 
Gradual growth of better PM materials, improved manufacturing technology, varying na¬ 
ture of construction of these motors to suit specific applications have brought them at a 
level where they are considered one of the best motors available nowadays. PM machines 
have a wide spectrum but this paper is restricted to PMBLDC motors. 

Presently PM materials used in PMBLDC motors are classified in the following three 
broad categories namely Alnico (Al-Ni-Co-Fe), Ceramics also include ferrites and 
rare-earth materials such as samarium-cobalt (Sm-Co), neodymium-iron-boron (Nd-Fe-B ). 
Alnico and ferrites have long been used in the development of PM motors as they are cheap 
and easily available. Rare-earth PM materials, namely SmCo, are used nowadays because 
of the high energy density caused by its high residual flux density, coercive force and low 
temperature coefficient. NdFeB is considered one of the best PM materials presently since 
it offers much higher residual flux density and coercive force. However, its only drawback 
is the temperature limit. Continuous efforts are being made to overcome this and it is hoped 
that this will enable PMBLDC motors to attain higher efficiencies and lower sizes along 
with other advantages. 

PMBLDC motors may be classified into different categories such as number of phases, 
radial or axial field, cageless or with cage bars, surface mounted PMs or buried mag¬ 
nets, sinusoidal or rectangular fed motors etc. Some of them are briefly discussed in this 
section. 

2.1 Number of phases 

PMBLDC motors are developed in single phase in low power (< 50W) for tube axial 
fans to cool electronics equipments . They are manufactured in two phase construction 
for home appliances such as solar PV fed refngeration system, servo control etc. Most 
of the medium and high power rating motors are designed in three-phase construction 
similar to conventional ac motors. In some electric vehicles and megawatt rating 
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lotors for submarine propulsion etc., designers have compelling reasons to increase the 
jmber of phases to five, six or more in order to reduce the per phase power handling 
squirements. 

.2 Radial and axial field motors 

lost of the motors in the market are radial field type (cylindrical or salient pole construc- 
on). However, the axial field motors have some advantages over the conventional radial 
eld construction in terms of power density, torque to inertia ratio, peak torque, less magnet 
'eight, low inductance, short winding turns, compact design etc. Axial field motors 
re designed in package, disk and sandwich type constmction and have no iron in the rotor, 
^suiting in low inertia. Axially directed magnetic field from rotor magnet interacts with 
idially directed currents in these axial field motors. The magnets are encapsulated in resin 
r plastic. Because of their construction, they are considered most suitable for robotics, 
Dmputer equipments, machine tools etc. 

Radial field motors are eilso designed with varying desired flux linkage waveforms such 
5 sinusoidal or trapezoidal, different shapes and positions of magnets in the rotor such 
s buried or surface mounted etc. They are widely used since stator design is. similar to 
onventional ac synchronous or induction motors. Figure 1 shows the typical cross-sections 
f these two types of popular PMBLDC motors. 

.3 Shape and location of PM in rotors 

'ermanent magnets are placed in the rotor in PMBLDC motors In axial field type of 
lotors, the magnets are encapsulated in resin or plastic in disc form as shown in figure la. 
’hese magnets are placed in such a manner that induced back emf are either sinusoidal 
r trapezoidal waveforms. In radial field motors, the magnets are placed in different form 
uch as surface mounted for low speed motors and interior radially oriented or interior 
ingentially oriented in high speed PMBL motors. Figure 2 shows such rotor geometries, 
"hey are also designed to achieve sinusoidal or trapezoidal back emfs depending upon 
pplications. 

'A Sinusoidal and rectangular fed motors 

*MBLDC motors are designed to have either sinusoidal or trapezoidal (excited) induced 
»ack emfs 8>2,6-32 sinusoidal excited motors are fed with sinusoidal polyphase currents 
irailar to conventional synchronous motors for ripple-free torque with unity power factor 
or constant torque operation below base speed with frequency control and having lead- 
ng currents to affect field weakening for constant power operation. Maximum speed of 
)peration is restricted with demagnetization caused by armature reaction and mechanical 
:onstruction. Magnetic saliency on rotor with reluctance torque helps to achieve wide 
ipeed range of constant power operation. Trapezoidal excited motors need polyphase bal- 
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(a) (b) (c) 

Figure 2. Rotor geometries of PMBLDC motors, (a) Surface mounted, (b) interior radially- 
oriented, and (c) interior tangentially oriented magnets. 


these rectangular currents they are also called switched PM motors, brushless dc motors 
d electronic commutated PMBLDC motors. 

Figure 3 shows the ideal current waveforms for these two types of motors. Position 
nsors requirement is accordingly changed to realize these ideal current waveforms in the 
otor windings in self synchronous control mode. 

Closed loop controllers 

'espective of sinusoidal or trapezoidal excitation, PMBLDC motors are used for posi- 
m control, speed control and torque control in motion control applications Figure 4 
ows a typical position closed loop control with inner speed and current loops. For speed 
introl system outer position loop is not required and speed reference is the command 
pial. Torque control is incorporated in high performance motion control through closed 
op regulation of phase currents in synchronization with shaft position feedback. In the 
ajority of PMBLDC motors, torque is linearly related to currents and torque command 
aps into current commands with only a simple proportionality constant. In some typ- 
al cases, a nonlinear mapping is required between torque and current commands. The 
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Figure 4. Block diagram of closed loop position control with speed and current loops of 
of PMBLDC motor drive. 


constant power operation of PMBLDC motor drive is extended through field weakening 
control techniques which also requires another command signal for torque to current map¬ 
ping. Current regulation in the phase windings of PMBLDC motors either in sinusoidal 
or rectangular manner is carried out using current controlled voltage source inverter(VSI). 
PWM, hysteresis and predictive current controllers are used to issue the switching signals 
to the devices of the inverter to realize winding currents close to command currents. Speed 
control is generally achieved by using a speed feedback and speed command through speed 
controller which outputs a command signal for torque controller. Position control is imple¬ 
mented through position feedback and position command using position controller. The 
output of the position controller is the speed command for inner speed loop. Both position 
and speed closed loop controllers are realized using wide varying closed loop controllers 
such as PI (proportional-integral), PID (proportional-integral-derivative) SMC (sliding 
mode controller) adaptive controllers fiizzy based control and neural based 
controllers 

These classical (PI or PID) controllers or advanced closed loop controllers such as 
SMC, fuzzy and neural network-based ones are implemented using DSP, microcontroller 
and specific application integrated chip (ASIC) for speed and/or position control. Many 
manufacturers have developed ASICs for typical applications of PMBLDC motors. 


4. Recent developments in inverters and converters 

PMBLDC motors are invariably fed from variable frequency inverters to provide electronic 
commutation. At small ratings, MOSFET-based VSIs are used to achieve ideal current con¬ 
trol with reasonable high switching frequency. In medium rating drives, IGBT-based VSIs 
are used to feed PMBLDC motors. GTO-based inverters are used in high power rating PM- 
BLDCM drives because of their self-commutating feature and improved current control. 
In medium and high power rating PMBLDC motors, an input unity power factor rectifier 
with PWM current control is used to achieve regulated dc link and regenerative feature of 
the PMBLDC motor drive. Classical PI/PID and feedforward closed loop controllers are 
used for the control of front end converters. It improves the power quality of the ac mains 
in terms of reduced harmonics and unity power-factor in both the directions of power 
flow. 
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Recently MOSFET/IGBT-based VSIs are used in module forms to achieve compact 
>ign of the drive. Power MOS ICs are used to control small rating PMBLDC motors 
dch has integrated microelectronics for control and power amplifier to excite the motor. 
3 reover, their gate drives are also used in module form with all protective and intelligent 
itrol features. In some typical applications (residential commercial blower), they are 
ide so compact that these are fitted inside the motor housing 


Latest trends in sensors 

the control of PMBLDC motors, position, speed and current sensors are essentially re- 
ired to regulate the phase currents in synchronization with rotor position . Moreover, 
netimes, terminal voltage sensors are also required to estimate either position or speed. 
Itage sensors are also needed to regulate dc bus voltage during braking or for front end 
nverter control. In some typical attempts, flux sensors and torque sensors are also used 
the precise control of these motors. Basic role of these sensors are already discussed in 
)sed loop control of PMBLDCM drive in § 3. In the following section, the recent trends 
sensors and their function are briefly discussed. 

L Position sensors 

le rotor flux position in PMBLDC motors is defined by the mechanical angle of rotation, 
lich is achieved from some form of rotor position sensors. Rotor flux position is required 
' phase current synchronization and rotor position is also required for position control. 
)tor position is directly sensed using position sensors or indirectly estimated using other 
jasured parameters. Hence, rotor position sensing is indispensable in current-controlled 
4BLDC motor drives. 

Rotor position is sensed using resolvers, inductive modular absolute system (MAS) 

11 effect position sensor, magnetoresistors electronics and optical encoders, synchros 
d tachsyn The tachsyn is an airgap reluctance sensitive 3-phase alternator with PM 
Id and trapezoidal output waveforms. It is used for position and velocity sensing and 
jnal outputs are analog. They are available in 4,6 and 8 pole configurations. The encoders 
j characterised by number of pulses per revolutions (PPR) and nowadays they are avail- 
le in several thousands PPR. Interfacing ICs are also available to convert these sensor 
jnals to digital form to feed digital processors used for intelligent control of PMBLDCM 
ives. 

Indirect position sensing is achieved by estimating the rotor position using other mea- 
red parameters such as currents and voltages etc. There are many techniques for rotor 
IX position estimation which are much detailed in the next section. 

I Speed/velocity sensors 

the PMBLDC motor drive, speed or velocity signals are essentially required for speed 
introl loops in position controlled drives and speed feedback for speed controlled drives. 
)eed measurement is carried out either using speed transducers/sensors or estimated 


using the rotor position information either obtained through direct position sensing or 
through estimation. 

In general dc tachogenerators and brushless tachogenerators are used to sense motor 
speed. They provide an analog dc voltage signal which is proportional to shaft speed. 
The polarity of this voltage signal in both types of tachogenerators results in the direction 
of rotation. Nowadays rotor velocity/speed is estimated more accurately by using high 
resolution position sensors or estimated rotor flux position. Sometimes, these sensors are 
different from the position sensors used for electronic commutation. 

5.3 Current sensors 

Fast torque control in high performance PMBLDC motor drives is implemented through 
closed loop regulation of phase winding currents in synchronization with rotor flux position 
information. Closed loop regulation of winding currents is realized through PWM or 
hysteresis current controllers of CC-VSI over the reference desired currents and sensed 
winding currents. Therefore, the sensing of winding currents becomes indispensable in 
PMBLDC motor drives. 

The current sensing is generally carried out using hall effect current sensors. They detect 
the magnitude and direction of currents and are integrated to provide sensitive and accurate 
current sensing. Very fast response (less than 1 microsecond) and accurate current sensors 
are available from different manufacturers (ABB, LEM etc.) in wide range of current 
sensing (fractions of amperes to kiloamperes). These hall effect current transducers have 
galvanic isolation of several kilovolts which is a very desired requirement of these drives 
in high rating. Generally in 3-phase motors two current sensors are required and third- 
phase current is estimated from other two-phase currents in star connected motors. These 
current sensing requirements for current regulated rectangular fed PMBLDC motor drive 
are typically reduced to a single current sensor in the dc link of the inverter. Current shunt 
resistors with low power dissipation are often used as the current sensor in low power 
drives for cost effectiveness. In the modem power devices such as MOSFET/IGBT a 
current sensing feature is provided by many manufacturers which also dispenses with the 
use of extra current sensors in the control of the inverter/converter feeding the PMBLDCM. 

5.4 Voltage sensors 

Terminal voltage sensing in modem advanced PMBLDC motor drives is required to esti¬ 
mate the rotor position and speed for the control, resulting in mechanical sensorless drive 
with a view to reduce size, cost, maintenance and enhanced reliability. Voltage sensing is 
also required to regulate dc bus voltage during braking or for control of front end converter 
used for regenerative feature in high rating drives. 

Terminal voltage sensing is carried out by using electronic isolation amplifier (AD202 
Analog Devices make etc.) and hall effect voltage sensors (ABB make etc.) with galvanic 
isolation. In small rating drives, voltage is sensed using high valued resistor potential 
dividers to reduce the cost of the drive. Sometimes, induced voltage in the motor windings 
is achieved using special windings such as search coils etc. However, ac mains voltages 
for the control of ftont end converter are sensed using the potential transformers. 
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Sensors elimination and reduction 

:;ent trends in sensors, their requirement and types of available sensors are already 
cussed in § 5. However, some of these sensors in PMBLDC motor drive may be reduced 
m the view point of size, cost, maintenance and reliability. Typically mechanical rotor 
>ition and speed sensors have the drawbacks of increasing the number of connections 
ween motor and controller, increased interference, limitation in accuracy of sensors 
5 to environmental factors such as temperature, humidity, vibrations etc., increased 
:tion and inertia and additional space in motor housing. Because of these problems 
ently there has been wide interest and developments in the techniques for elimination 
mechanical rotor position/speed sensing by estimating the rotor position and speed 
ng sensed currents and voltages ^0-57 Moreover, the number of voltage and current 
isors may be reduced through using intelligent processors for the control of inverter 
ding PMBLDC motors. Various techniques for these sensors elimination and reduction 
briefly discussed in the following section. 

Mechanical sensors elimination 

le of the most recent developments in PMBLDC motor drives has been the rapid evolution 
new techniques for eliminating the rotor angular position sensor. Elimination of the 
ift-mounted position sensor is a very desirable feature in a number of applications since 
s sensor is one of the most expensive and fragile components in this drive. Some of the 
sition sensorless schemes are classified in brief. 

I Back EMF based position estimation 

e most common methods of rotor flux position sensing is based on deriving the back 
if signals. There are many methods for rotor position estimation based on the back emf 
d are briefly discussed below. 

la Direct back EMF detection: This method is quite popular for rectangular fed 
TBLDC motors. In these PM motors, particular phase winding is excited for 2/3 of each 
jctrical period and ideally there is always one phase which is not excited. The direct 
[ising of back emf of unexcited phases in sequence is used to generate discrete rotor 
sition signal for current synchronization with rotor flux. It has been applied to many 
lustrial applications including disk drives compact stereo players and room air 
nditioners . 


Zb Estimation of back EMF: This method is applied to both sinusoidal fed and rect- 
gular fed PMBLDC motors. This method is based on the reconstruction of the back 
if by using voltage equation of the motor {e = v — iR-Ldii/dt). Reconstruction of the 
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terminal voltage and line current are measured directly and the above equation is used to 
achieve back emf and rotor position. 

6.3 Third harmonic voltage detection based position estimation 

In star-connected PMBLDC motors, the third harmonic voltage is measured between the 
star point and an artificial star point created by three high-value resistors which are con¬ 
nected to the motor terminals. Such a voltage gives six zero crossings in the 3-phase motor 
and results in rotor position for current synchronization with rotor flux. 

6.4 By monitoring current or computation of the phase inductance 

The basic concept of this method is that the rate change of current in a phase winding 
of the motor depends on the incremental inductance which is rotor position dependent. 
This phase v/inding inductance variation with rotor position is used to estimate the rotor 
position for electronic commutation of the inverter. This current sensing model for rotor 
position estimation is used by Lin et al 

6.5 Injecting diagnosis signal to the stator winding 

This method uses a PWM carrier frequency and inductance bridge to measure the rotor flux 
path reluctance. The method operates on a bridge principle by monitoring the inductance 
difference in two phases, and is sensitive to small variations in reluctance. 

6.6 Observer methods 

In these methods, an observer reconstmcts the rotor position which is directly measurable. 
Basically all these methods use the sensed phase and/or line currents to perform on-line 
compensation to derive the rotor position. A number of observer methods such as Kalman 
filter technique, discrete time observer, state observer and stator flux estimation method 
current and the voltage based observer method as well as improved different types 
of motors, are reported. 

6.7 By a special windings electromagnetic devices 

A number of methods based on special winding such as search coils or an electromagnetic 
device are used to sense the rotor position. The electromagnetic device consists of pick¬ 
up coils around a special stator made of magnetically nonhnear material. The pick-up 
coils are excited by a high frequency sinusoidal current. The device detects the phase of 
the second harmonics component of the induced voltage in the pick-up coils. 

6.8 Monitoring switching states in the inverter 

This method is apphed to rectangular fed PMBLC motor and ON/OFF states of inverter 
switching devices are used for rotor position estimation The method is based on a 
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;or with trapezoidal back-emf. The rotor flux position is obtained on the basis of the 
ducting state of free-wheeling diodes in an open phase at a particular time, 
n most of these methods, the rotor speed/velocity is estimated by using the time deriva- 
of rotor position angle or by measurement of the period during the transition of alter- 
ng rotor position. 

Elimination/reduction of current and voltage sensors 

snsive work is carried out to reduce the current and voltage sensors in the PMBLC 
:or drive to reduce the cost and enhance reliability. Normally two current sensors are 
aired in 3-phase star connected motors, a technique used to estimate the 3-phase winding 
rents only using one current sensor in the dc link and switching states of the inverter 
ices. Since, in the intelligent inverter control, device-switching patterns are available 
he processor, the 3-phase winding currents signals are constmcted using measured dc 
: current and switching status of the inverter devices. Similarly, one voltage sensor is 
d at the dc link and three-phase terminal voltages are derived using the same switching 
es of the inverter devices. 

’or rectangular fed PMBLDC motor drives, curi'ent sensors can be eliminated entirely 
ig current sensors embedded in three of the six inverter switches MOS gated devices 
h as MOSFET and IGBTs in integrated modules from different manufacturers are 
ilable which incorporate current sensors integrated into monolithic power devices. 


Torque pulsations, noise, vibration and their reduction 

y divergence from ideal conditions either in the motor (design factors) or in the power 
erter feeding PMBLDC motor drive (current waveforms) results in undesired torque 
sations The torque pulsation in PMBLDCM drive causes speed oscillations, 
itation of resonances in mechanical portions of the drive causing acoustic noise and 
ible vibration patterns in high precision machines. Pulsating torque in PMBLDC motors 
asically in form cogging torque generated by interaction of the rotor magnetic flux with 
:or magnetic reluctance variation and ripple torque generated by the interaction of stator 
rentmmfs with rotor magnets. Ripple torque is due to mutual or alignment torque caused 
current mmf with rotor flux distribution and reluctance torque caused by current mmf 
h rotor reluctance variations. In rectangular fed PMBLDC motor, the fringing fields at 
rotor pole edges cause deviations from ideal trapezoidal emf waveform and currents 
also not strictly rectangular resulting in dip of the torque magnitude up to 25% of rated 
que 

Forque pulsations are minimized by two broadly classified techniques, namely improved 
tor designs and active control schemes. Improved design techniques for pulsating torque 
rimization include skewing, fractional slot winding, short pitch winding, increased 
nber of phases, airgap windings, adjusting stator slot opening and wedges rotor 
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However, active control techniques for pulsating torque minimization include adaptive 
control technique preprogrammed current waveform control, selective harmonics in¬ 
jection techniques, estimators and observers, speed loop disturbance rejection, high speed 
current regulators, commutation torque minimizations and automated self commis¬ 
sioning schemes 


8. Field weakening 

Field weakening of PMBLDC motors has become quite demanding for achieving high 
speed operation in constant power applications such as electric vehicles etc. For 
speeds above base speed, the rotational emf increases rapidly and constant power operation 
cannot be maintained at high speeds. The amplitude of back emf (line to line) increases 
linearly with speed and becomes more than the dc link voltage of the inverter. Eventually 
the current controller saturates losing its ability to force the desired reference currents into 
the motor phases. 

Reasonable torque production may be attained by advancing the phase angle of the 
excitation transition points relative to back emf. As this angle is advanced, cuiTent in on 
coming phase is given a controlled time interval to build up before the back emf increases 
and chokes off further current growth. This concept is illustrated through figure 5. Figure 
5a shows the concept of advance angle (o:) in back emf and current of a particular phase 
winding of rectangular fed PM motor for 2/3 period excitations (120°). However, this 
concept is also applicable for current excitation of 180°. The resulting effects of this 
advance angle excitation on torque-speed characteristics of the PMBLDCM are shown in 
figure 5b. This concept is used in both sinusoidal and rectangular fed motors to extend the 
speed range as constant power drive. This control aspect can give at the most, the speed 
range 3:1. This speed range may only be extended to 7:1 or more by proper designing of 
the motor such as axially laminated PM reluctance motor. 




Figure 5. (a) Typical back emf and phase current waveforms of rectangular fed PMBLDC 
motor drive with advance angle, (b) Typical torque-speed curves of rectangular fed PMBLDC 
motor drive with advance angle control. 
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9. Impact of microeiectronics on PMBLDC motor drive 

Improved and intelligent control techniques for the PMBLDCM drive are implemented 
through advanced technology in integrated electronic circuit chips ASICs (specific 
application integrated chips) are developed by many manufacturers namely LM621 by 
National Semiconductors Limited, (Si9985CY) by Silconix, SA/SE/NE5570 by Phillips 
etc. for the PMBLDCM drives. Moreover, sensorless operation of PMBLDC motors, adap¬ 
tive control, torque pulsation minimization etc are realized through micro controllers, DSP 
and single chip microcomputers. Reduced cost of manufacturing of VLSI has made it pos¬ 
sible to design dedicated ASICs for such complex and intelligent control and providing 
compact, reliable, low cost and effective controller for PMBLDCM drives For many 
dedicated applications, power and control are integrated in the same MOS ICs for low 
power PMBLDCM drives. 


10. Application issues of PMBLDC drives 

PMBLDC motors are attractive in drive applications which benefit from high efficiencies 
as compared to induction motor and switched reluctance motors. Sinusoidal-fed PMBLDC 
motors are considered superior to rectangular-fed PMBLDC motors for achieving the wide 
high speed constant power speed range The positive characteristics of PMBLDC motor 
discussed earlier make them highly attractive for a number of applications, a few of which 
are given here. 

10.1 Servo actuators 

High power density, fast dynamic response, smallest size and weight with rare earth magnet 
(Nd Fe-B or SmCo) of PMBLDC motors have made them the best suited candidates in 
machine tool servos, robotics actuators drives and dual tandem electromagnetic actuators 
(EHA)^^ 

10.2 Commercial residential speed control applications 

PMBLDC motors drives are used in a wide range of commercial and residential applications 
such as domestic appliances, heating, ventilating and air conditioning equipment due to 
their highest possible efficiencies. The speed control ability of compressors and blowers 
is able to provide operation at their high efficiency. The physical integration of controller 
electronics in the motor body itself is able to make them most suitable for low power (0.5 hp) 
blowers and low power (50 W) tube axial fans for cooling the electronics equipment 

10.3 Automotive applications 

The efficient variable speed feature with low volume of PMBLDC motors has made them 




capable of providing wide speed range constant power characteristics at high efficiency to 
match these automotive applications, 

10.4 High power industrial and propulsion drives 

High power (1.1 MW), six-phase PMBLDC motors are considered best-suited for direct 
drive marine propulsion application to replace the dc motor drive with a saving of 40% 
in weight, length and volume. PMBLDC motors upto 2 MW power rating and speed of 
4000 rpm are used in land and marine propulsion systems and industrial pumping. 

With high efficiency and high power density, the demand for PMBLDC motor drives is 
increasing and is expected to continue for high performance servos, commercial residential 
products, computer industry, information technology, propulsion systems, vehicle and 
traction. 

11. Conclusions 

The latest trend and present status of PMBLDC motor drives have revealed that this drive 
has great potential for use in a number of applications. The use of new PM materials, 
latest motor construction technology, intelligent closed-loop control, new types of invert¬ 
ers, sensorless operation, and field weakening have resulted in potential applications and 
provided a broad prospective to drive-design and application engineers. The methods of 
torque pulsation reduction have been explored to design PMBLDCM drives to suit highly 
position-sensitive applications. In view of these new developments, these drives are ex¬ 
pected to enjoy a bright future in a larger number of applications of adjustable speed 
applications requiring high efficiencies and fast dynamic responses such as blowers, com¬ 
pressors, automobiles, vehicle propulsion systems and traction drives. 
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Introduction 

wing to the advantages associated with ac motors, ac motor drives are fast replac- 
ig dc motor drives. Modem ac motor traction drives have several important features 
ich as regenerative braking capability, very little maintenance and down-time, near- 
P/» nearly sinusoidal current, good adhesion, high average speed and high 
?ficiency. 


Important features of electric traction drive (Dubey 1994, p. 327) 

0 For economic reasons, a 25 kV, 50 Hz, ac traction drive employs only single-phase 
supply although the rating of the locomotive is as high as 6000 hp. 

1) The traction supply system is weak in nature and hence the voltage is subject to 
fluctuations. 

V) As traction supply is weak in nature, the reactive power has a very adverse effect. It is 
essential that the power factor is not allowed to be lower than 0.8. 

!•) Use of electric brakes reduces wear and tear on the track, wheels and brake shoes, 
thereby increasing their life substantially. 




(6) Regenerative braking is generally combined with the dynamic braking. 

(7) Use of the controlled rectifiers results in the generation of harmonics, which are injected 

into the source. This results in the following adverse effects. 

(a) High frequency harmonics cause interference with the communication line. 

(b) Low frequency harmonics may enter into the track circuits, leading to the mal- 
operation of the signals. 

(c) They can cause sharp fluctuations in the supply voltage. 

3. Advantages of AC motor traction drives over DC motor traction drives 

Modem traction drives generally use three-phase induction motors as traction motors. The 

ac motor drive has the following advantages. 

(i) Three-phase induction motors are robust and have high torque-to-weight ratio. 

(ii) They have simplified and reduced maintenance because of the absence of the com¬ 
mutator and bmsh gear as in Jc motor drives. 

(iii) Full use of the available adhesion between wheel and rail is possible because of the 
naturally steep torque-speed characteristic of the induction motor. 

(iv) Induction motors have a good regeneration capability. 

(v) Due to absence of a commutator, the motor windings can be designed for higher volt¬ 
ages. This results in more favourable design of other components such as inverters, 
converters, transformer secondaries etc. 

(vi) Current rating of the induction motor is low. This results in reduction of cable size, 
number of contactors, number of switches etc. 

(vii) At the same power levels, induction motors are lighter than dc motors. This results in 
the relatively smaller unspmng mass of the truck giving good riding characteristics 
.and low rail stress. 

(viii) Three phase induction motors have improved efficiencies and reliabilities in opera¬ 
tion than do dc motors. 

(ix) With the three-phase drive, electric braking down to standstill is possible. 

4. Suitability of VSI and CSI drive for traction application (Dubey 1997) 

A three-phase AC induction motor should be fed from a three-phase supply capable of 

delivering steplessly variable frequency and voltage. The current source inverter and the 

voltage source inverter are available to meet this requirement. The important relative merits 

and demerits of CSI and VSI drives in relation to traction are as follows. 

(i) CSI drives do not suffer from the shoot-through fault which is common in VSI drives. 

(ii) Regenerative braking capability is inherent in CSI drives fed from a line-commutated 
fully controlled converter or a PWM ffiUy controlled converter and in VSI drives 
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powered from a synchronous link converter. If the ac supply fails, the regenerative 
braking will not be possible in both the drives. Under such conditions, a VSI drive 
can use dynamic braking but the same is not possible with a CSI drive. 

) The presence of a large value inductance in the dc link of a CSI drive results in the 
slower dynamic response compared to a PWM VSI drive. Consequently VSI drive 
has better adhesion (i.e. lower possibility of wheel slip). 

) When the source is dc, a PWM VSI drive will be cheaper compared to a CSI drive 
of the same rating. Also the requirement of the large commutation capacitors, and 
a large dc link inductor (which is over-sized to prevent saturation), the volume and 
weight of a CSI drive is much higher compared to PWM VSI drive. 

) The frequency range of the CSI is lower than that of the VSI drive. Hence the CSI 
drive has a lower speed range. 

) The CSI is not suitable for multimotor drives. Hence each motor is fed by its own 
inverter and rectifier. But a single diode bridge or a synchronous link converter can 
be used to feed a number of VSI motor systems. Alternatively a single VSI can feed 
a number of motors. 


AC motor traction drive 

Principle of operation 

e block diagram of a popular ac motor traction drive is shown in figure 1. 

The pantograph collects the power for the mnning locomotive from the overhead line, 
le pantograph is connected to the primary of the transformer. The isolation and protection 
vices are provided between the pantograph and transformer. 

The traction transformer has a single primary winding and multiple secondary windings 
: feeding traction converters. The traction transformer is so designed that the percentage 



Figure 1. Block diagram of a popular ac motor traction drive. 
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impedance of all the secondary windings feeding traction converters are maintained un¬ 
changed irrespective of the combined or isolated operation of the windings. 

Traction power converters consist of PWM converters. These converters are connected 
in series or parallel. Each unit is of modular construction with built-in cooling arrangement. 
Converters are operated in a way so as to maintain a power factor of unity throughout the 
operating range, with practically zero harmonic injection into the line. 

The dc link consists of a suitable bank of capacitors designed to provide a stable dc link 
voltage for feeding variable voltage variable frequency (VVVF) inverter in voltage source 
configuration. Alternatively dc link may consist of a suitable inductor designed to provide 
a stable dc link current for feeding CSI inverter in current source configuration. 

The dc link decouples the drive from the source. The inverter used may be CSI or VSI 
type. Generally CSI type inverter is used with synchronous motor drives and VSI type 
inverter is used with induction motor drives. The output of the inverter is connected to 
three-phase traction motors. The synchronous motors have higher full load efficiencies 
and power factors than induction motors. However, compared to squirrel cage induction 
motors they have higher cost, weight and volume for the same rating and require more 
maintenance. 

Generally, microprocessor-based control systems are used for control of converters, dc 
links, inverters, traction motors, braking and slip. The microprocessor also performs the 
task of fault diagnosis and display in addition to that of control. 

5.2 Control of three-phase ac motor traction drive 

Several drives employing squirrel cage induction motors and synchronous motors are in use 
for traction. Variable frequency control is used both for induction motors and synchronous 
motors. 

Modes of operation of an induction motor traction drive are shown in figure 2. This 
drive has received wide acceptance. 
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^hen the drive is in the motoring region, the characteristic curve has three regions 
lenkamp & Peak 1985; Dubey 1989). 

Constant torque region; 
constant power region; 
slip limited region. 


'Constant torque region: The constant torque region ranges from standstill to base 
;d {cob). In this region the inverter is operated to supply adjustable voltage and frequency 
le motor. The motor voltage is adjusted as speed (frequency) changes to maintain a 
stant flux density in the motor. The motor voltage therefore increases proportionally 
i speed (frequency). The frequency of the voltage induced in the rotor (slip frequency) 
eld constant. This produces nearly constant torque and, since the power output is 
3ortional to the speed, the power increases linearly with speed. At base speed (cob), the 
voltage reaches the maximum limit of the inverter. 


Constant power region: In this region, the inverter is operated to supply the adj ustable 
[uency to the motor. The voltage is no longer adjustable and the inverter is operated 
laximum voltage limit. The power output of the motor remains nearly constant. This 
ie of control is extended until point a)\, where the motor breakdown torque limit is 
;hed. 


I Slip limited region: In this region, the slip frequency is maintained constant and 
increase in speed is achieved by reducing the motor current. Here the motor current 
uces inversely with speed and the torque decreases inversely as speed squared. This 
racteristic is often referred to as the series motor characteristic. The torque produced 
his region is somewhat higher than that produced by a dc series motor. 

Vhen the drive is in the braking region, the characteristic curve has the following regions 
)peration (Plunkett & Plette 1977). 

Constant power region; 
constant torque region. 


Constant power region: In this region, drive power is held constant to match the 
erter maximum power capability. This mode is similar to the constant power mode 
ing motoring and slip frequency varies in proportion to the speed (frequency). 


Constant torque region: When the drive is braking, the constant torque region has 
) modes of operation. 

Model i - In this region, both the motor voltage and current vary approximately as the 
lare root of speed. Hence the power varies directly with speed, as the slip frequency is 
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Table 1. Enforced limits, for psophometric currents. 


Particular Enforced limits 


Psophometric current 

<10A 

DC component of current 

<4.7 A 

Second harmonic current 

Audio frequency component 

<8.7 A 

1650-1750 Hz 

1950-2050 Hz 

2250-2350 Hz 

2550-2650 Hz 

<400 mA amplitude 

High frequency component 

<270 mA amplitude 

5050-5100 Hz 



varied in direct proportion to the speed. The power fed back to the source is proportional 
to the speed, decreasing linearly with speed. 

(h) Mode 2 - In this region, the motor current is held constant as slip frequency is held con¬ 
stant. Motor voltage is adjusted as speed (frequency) falls to maintain a constant flux density 
in the motor. Motor voltage therefore reduces proportionally with speed (frequency). At 
very low speeds the electric braking is allowed to fade. 

5.3 Requirements ofac motor traction drives 

(i) The source side converter should ensure a power factor as near unity as possible. 

(ii) The converter should be designed such that the psophometric disturbance current is 
kept within an enforced limit (table 1). 

(iii) The converter should be designed such that no intolerable level of interference is 
caused to track circuit, signal or telecommunication equipment. 

(iv) The power converter/inverter should ensure four-quadrant operation, and regenerative 
braking should be available from maximum speed range up to standstill. 

(v) The power converter/inverter should ensure the full utilization of the available 
adhesion. 

(vii) The inverter should incorporate beatless control system to suppress beat phenomenon. 


6. Load-commutated inverter synchronous motor traction drives 

6.1 Basic principle (Dubey 1989, p. 416) 
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Figure 3. Load-commutated inverter synchronous motor drive. 


In a self-controlled synchronous motor drive, the following control strategies are gen- 
ally used. 

f) Constant margin angle control; 
li) constant lead angle control; 
li) constant no-load angle control. 

In all the above control strategies, the synchronous motor operates with a leading power 
ctor and the thyristors of load-side converters are commutated by the motor-induced 
)ltages, in the same way as the thyristors of line-commutated converters. Because the 
ing is synchronized with the machine-induced voltage, the machine always operates in 
:lf-control mode. The supply to source side converters may be dc or ac. 

A load-commutated inverter synchronous motor traction drive using thyristors is shown 
I figure 4. Here the supply is dc. The dc current source is formed by a two-quadrant 
lopper (quadrants I & VI), a dc link inductor and a current feedback. The load-side 
inverter is commutated by the back emf of the synchronous machine (Pearson & Sen 
?84). 

When the overhead catenary is ac, the scheme shown in figure 5 is used. Here the 
irrent source is formed by source side converter (a controlled rectifier), dc link inductor 
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Figure 5. Regenerative synchronous motor ac traction drive. 
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Figure 6. Regenerative induction motor drive using synchronous link converter. 


d the current loop. The load-side converter is operated in self-control mode and with 
ad commutation. 

2 Control of load commutated inverter synchronous motor drive 

hen the drive is in motoring, the source-side converter acts as a rectifier and the load-side 
nverter acts as an inverter, resulting in the forward motoring operation. In this mode of 
»eration the average value of Vi (figure 3) is negative, and fi is positive and hence the 
)wer flows from the dc link to the machine giving operation in quadrant I. 

For braking the drive, the load side converter is controlled such that the average Vi is 
(sitive. Since the direction of remains unchanged, the power flows from the motor to 
Qdc link. In this mode of operation, the load-side converter works as a rectifier and the 
achine regenerates. The source-side converter is operated to feed the energy back to the 
urce. 

For reversing the direction of the drive, the phase sequence of the load-side converter 
reversed by interchanging the control signals between switches of any two legs of the 
verter. 

In the case of the load-commutated synchronous motor drive, at low speeds the induced 
nf will be insufficient to commutate the thyristors of the load-side converter. Therefore, 
start and for speeds below 10% of base speed, the commutation of the load-side converter 
yristor is done by forcing the current of conducting thyristors to zero, with the help of 
e source-side converter (Peterson & Frank 1972). 

Alternatively, a synchronous motor can be started by employing a simple forced- 
>mmutation circuit utilizing a single commutation capacitor and two auxiliary thyris- 
rs for the entire inverter (Steigerwald & Lipo 1979). 

VSI-squirrel cage induction motor drive 

1 Basic principle 

regenerative induction motor drive using a synchronous link converter is shown in 
Jure 6. The synchronous link converter permits the realization of an economic regenera¬ 
te a c drive with a power factor of nearly unity and a low harmonic cuiTent in the source 
irrent. The converter circuit is shown in figure 7. 
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Figure 7. Single phase synchronous link 
converter. 


When the single-phase voltage source inverter is operated with pulse-width modulation, 
it produces the voltage Vi. If the voltage Vd is maintained constant, the magnitude of V/ 
may be varied by adjustment of the modulation index of the converters. The phase of V/ 
w.f.t. Vs can be altered by changing the phase of firing pulses of the inverter, w.r.t. the 
phase of Vs. By appropriately choosing the magnitude and phase of Vi for a given Is, the 
phase angle between Vs and 4 can be adjusted to a desired value. 

Figure 8 shows the phasor relationship of various vectors when the phase angles between 
the source voltage Vs and source current fs are 0° and 180° respectively. In either case 
the power factor is unity. When the phase angle is zero, the converter works as a rectifier 
transferring power from ac source to the c/c link. In this mode of operation, current Id 
has a positive direction. When the phase angle is 180° current Id reverses and the power 
flows from the dc Link to the source and the converter works as an inverter. Because of 
the operation of the converter with pulsewidth modulation, the source current has a low 
harmonic content. 

Figure 9 shows a three-phase regenerative traction drive using a synchronous link con¬ 
verter and PWM inverter. Here two synchronous link converters are connected in parallel 
feeding an inverter, which in turn feeds three induction motors connected in parallel. Two 
such units are fed from a common transformer. 

Depending upon the power level IGBTs/GTOs can be used in the construction of a 
synchronous link converter. In case of Electrical Multiple Units (EMU), IGBTs can be 
used both in synchronous link converter and inverter. Since the IGBTs can be operated at a 
frequency of 2.5 kHz the harmonics in the source current can be reduced to a smaller value. 
We can use sinusoidal PWM, current controlled PWM or vector PWM for both converter 
and inverter. In case of a locomotive, because of the large power levels, GTOs have to be 
used. Since at high power levels, GTOs are operated around 400 Hz, generally sinusoidal 
PWM is used and more than one converter is operated in parallel to keep the harmonics in 
the source current within acceptable limits. 


Vi 






_ ^woniaci wire 

Pantograph 



Figure 9. Regenerative traction drive using synchronous link converter and PWM VSI. 


7.2 Control of VSI - Squirrel-cage induction motor traction drive 

The modem VSI squirrel-cage induction motor traction drives are generally torque- 
controlled drives. The drive is started with reduced voltage and frequency. This frequency 
reduction improves the motor power factor and increases the torque per ampere at the start. 
Hence the rated torque is available at the start, and the drive is accelerated to its operating 
speed. The drive is operated as described in § 5.2. Here the power is transferred from the 
source to the load. 

For transferring the operation from motoring to regenerative braking, the inverter fre¬ 
quency is reduced. A reduction of inverter frequency makes the synchronous speed less 
than the motor speed and transfers operation from quadrant I to quadrant II. Here the power 
flows from motor to dc link and from dc link back to source. The inverter voltage and 
frequency are reduced to brake the machine to zero speed. 
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VI 



Figure 10. Closed loop control around 
a synchronous link converter. 


For speed reversal, the phase sequence of the inverter voltage is reversed by interchang¬ 
ing the control signals between the switches of any two legs of the inverter. In the scheme 
described above, when the source is unable to take back the regenerated power, a dynamic 
braking scheme can be used. 

For this a braking resistor is connected in series with a switch across thedc link capacitor 
‘C’. The generated power charges the filter capacitor and its voltage rises. When the filter 
capacitor voltage reaches a prescribed maximum value, the switch is turned on, connecting 
the braking resistor across the capacitor. The energy generated and the energy supplied 
by the filter capacitor are dissipated in the braking resistor. The capacitor voltage falls. 
When the capacitor voltage reaches a prescribed minimum value the switch is turned off. 
Again the capacitor voltage starts rising and the cycle repeats. Thus only that portion of 
the regenerated energy is dissipated by dynamic braking which cannot be accepted by the 
source. 

The following control techniques are generally used with synchronous link converter. 

(1) Current-controlled PWM; 

(2) Indirect current control (sinusoidal pulsewidth modulation and its modifications). 
With the above mentioned current controlled techniques, for a given value of reference, 

the power input to the converter has a fixed value. When the load on the converter is 
decreased, there is imbalance between the power input and the power output. Hence a 
closed loop voltage control is provided across the synchronous link converter as shown in 
figure 10. 


8. Harmonic reduction and control of torque pulsation 

The line side converter generates current harmonics which are injected into the overhead 
lines. Hence, measures are taken to keep the harmonics below a desired limit. Some of the 
methods used with VSI squirrel cage induction motor drive are as below. 

(i) By using series/parallel connection of converters, the supply current harmonics can 
be reduced considerably. If N converters are put in series/parallel and if the carrier 
signals of individual converters are equally displaced by an angle n/N, then some 
of the reflected harmonics cancel on the primary side of the transformer. The ripple 
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frequency of the primary current has a frequency {N x fc), where N is the number 
of converter put in parallel/series and /c is the carrier signal frequency. 

For example, if the synchronous link converter is realized by using two GTO con¬ 
verters, say converter I and converter II in parallel, then the firing of the GTOs in 
converter I and converter II is done by using a common modulating signal and the 
triangular carrier wave phase shifted from each by an angle 90° (because = 2) as 
shown in figure 11. 

l) By using three-level converters as compared to two-level converters. 

i) By employing switched electronic compensator which generates an exact replica of 
the harmonic current. It is fed into an extra winding of traction transformer to produce 
harmonic counter emf. This ensures that the transformer main flux is sinusoidal, and 
hence so is the induced voltage in the primary. The line current is low-pass filtered 
by the transformer leakage inductance and assumes a sinusoidal waveform. 

For reducing the harmonics in the inverter output voltage and hence to minimize the 
:que pulsations, the inverters can be connected in parallel. They share the common dc 
k and the ac output terminals of the respective phases are connected through an inter- 
ase reactor. The mid-point of the inter-phase reactor is connected to the respective phases 
an induction motor. The inverters are operated with common modulating signals, but 
th the carrier signals of individual inverters being equally displaced within the carrier 
riod interval. 

Advantages of VSI squirrel-cage induction motor traction drive 

le VSI squirrel-cage induction motor traction drive has the following advantages. 

(i) Power factor of unity. 

(ii) Low harmonic content in the source current. 

iii) Regenerative braking capability at low cost. 

iv) Smooth acceleration due to absence of low speed torque pulsations. 

(v) Efficient and quiet operation due to low harmonic content in the motor input voltage. 
Vi) Good adhesion due to fast dynamic response and absence of torque pulsations, 
vii) Multimotor drive possible due to use of VSI. 
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(viii) Regenerative braking capability offering energy saving. 

(ix) Dynamic braking can be resorted in the event of catenary failure due to use of VSI. 

(x) Improved energy efficiency. 

(xi) Reduced maintenance cost. 

(xii) Reduction in the size and weight of traction circuit equipment. 

(xiii) Possibility of reducing the capacity of the traction substation, because the power 
factor of the system is always unity, both in powering and braking. 

10. Conclusions 

AC motor traction drives are superior to DC motor traction drives. Because of recent ad¬ 
vances in power electi-onics, drive control techniques and availability of micro-controllers 
for control and fault diagnosis have enabled the realization of traction drives with high 
efficiencies, high levels of performance and low down-times. 
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